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Change of Variables in Multiple Integrals 


Peter D. Lax 


Dedicated to the memory of Professor Clyde Klipple, who taught me real variables 
by the R. L. Moore method at Texas A & M in 1944. 


1. Let y = g(x) be a differentiable mapping of the interval S = [c, d]. Denote by 
T the interval [a, b] with g(c) = a, p(d) = b. Let f be a continuous function of y. 
The change of variable formula says that 


d 
[ P(e) Fa = [ F() &. (1.1) 


The usual proof uses the fundamental theorem of calculus. Denote by g an 
anti-derivative of f: 


1.2 
f ~ dy . ( 7 ) 

According to the fundamental theorem of calculus, 
[ f(y) ay = (6) — 8(4), (1.3) 


where a and b are the endpoints of the interval T. On the other hand, by the chain 
rule the derivative of the composite go ¢ is given by 


d de dy 
5,8( 9(*)) = Ae 


Using (1.2) we see that the x derivative of go is the integrand on the left in 
(1.1); therefore by the fundamental theorem of calculus, 


d 
[I(o() Za = (ed) ~ 8(00)), (1.4) 


where c and d are the endpoints of the interval S. Since g(c) = a and g(d) = b, 
the right sides of (1.3) and (1.4) are the same; this completes the proof of (1.1). 

The usual proof of the change of variable formula in several dimensions uses 
the approximation of integrals by finite sums; see for instance [7]. The purpose of 
this note is to show how to use the fundamental theorem of calculus to prove the 
change of variable formula for functions of any number of variables. Then, as a 
surprising byproduct, we obtain a proof of the Brouwer fixed point theorem. In the 
last section we compare our proof with other known analytic proofs of the fixed 
point theorem. 

I thank Daniel Velleman for suggesting a substantial simplification of the 
argument. . 


2. In this section we study mappings g(x) =y of n-dimensional x space into 
n-dimensional y space. We impose two assumptions: 


i) @ is once differentiable. 
ii) g is the identity outside some sphere, say the unit sphere: 


g(x) =x for |x| > 1. 
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Change of variable theorem:. Let f be a continuous function of compact support. 
Then 


[f(e(®) J (4) de = [ f(y) oy, (2.1) 


where J is the Jacobian determinant of the mapping ¢: 


I(x) = jp: | | (2.2) 
OX; 


l 


here ¢; is the j‘" component of ¢. 


We prove this for functions f that are once differentiable and for mappings ¢ 
that are twice differentiable; since functions and mappings can be approximated by 
differentiable ones, this suffices. The approximation can be accomplished by 
mollification, that is, by convolving each component of ¢ with a smooth, nonnega- 
tive, spherically symmetric function m with small support whose integral equals 1. 
As the support of m shrinks to zero, m * ¢ and its first derivatives tend to those of 
gy. If ¢@ is the identity, so is m* g. 


Define 


Bl 
G(s Yase+es In) = fo FZ, Yas-++s Yu) a. (2.3) 


Clearly, = =f. Since f is once differentiable, so is g. Since f has compact 
support, we can choose c so large that f is zero outside the c-cube 

el Sie, 2S Ay Qyeiag ns 
It follows from (2.3) that g(y,,...,y,) = 0 when ly,| = c for any j # 1, and when 
‘ Take c > 1; then the c-cube contains the unit ball. Since @ is the identity 
outside the unit ball, f(g@(x)) is zero outside the c-cube in x-space. So in the 


integrals in (2.1) we may restrict integration to the c-cube. 
In the left side of (2.1), express f as the partial derivative of g: 


[ (ola) de (2.4) 


We denote by D the gradient with respect to x; the columns of the Jacobian 
matrix dp/dx are Dg,,..., Dg,. 


Observation:. The integrand in (2.4) can be written as the following determinant: 


det( Dg(¢), Dgz,..., De,). (2.5) 


Proof: By the chain rule 


n 


De(¢) =  (4,8)Dg,. (2.6) 

j=l 
We set this into the first column in (2.5). Formula (2.6) expresses Dg(¢) as a linear 
combination of the vectors Dg,, De,,..., Dg,; the last n — 1 of these vectors are 
the last nm — 1 columns of the matrix in (2.5), and therefore these can be subtracted 
from Dg(¢) without altering the value of the determinant (2.5). This leaves us with 
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det((d, g(¢))De,, Dgz,..., De); factoring out the scalar (0, )g(@) gives 
(d, )g(g)J, the integrand in (2.4). il 


The next step is to expand the determinant (2.5) according to the first column; 
we obtain 


M,4,.8( 9) + + +M,,4,,8(¢), (2.7) 
where M,,...,M,, are the cofactors of the first column of the Jacobian matrix. 
Setting (2.7) into the integrand in (2.4) we get 

J (Mia, 8() + + $M, 4, 8( 9) a. (2.8) 


Since ¢ is twice differentiable, we can integrate each term by parts over the 
c-cube and obtain 


— [a(9)(4M; + +++ +0, M,,) dx + boundary terms. — (2.9) 


We use now the following classical identity: 
0,M, ++ +0,M, = 0. - (2.10) 
We sketch a proof: We can write the left side of (2.10) symbolically as 
det(D, De,,..., De,). (2.11) 
For n = 2 we have 
det(D, De,) = 0,0, 0, — 0,0, 9, = 0. 
For n > 2 we note that the cofactors M, are multilinear functions of the g,. Using 
the product rule of differentiation, we write (2.11), again symbolically, as 
_ y. det(D, Dg,,...,D¢,);,; (2.12) 
2<k<n 


where the subscript k means that the differential operator D in the first column 
acts only on the k column. We leave it to the reader to verify that each of the 
determinants in the sum (2.12) is zero. 

The identity (2.10) shows that the n-fold integral in (2.9) is zero. 

We turn now to the boundary term in (2.9). Since g(g(x)) = g(x) on the 
boundary of the c-cube, the only nonzero boundary term is from the side x, = c; 
since M, = 1 when g(x) =x, that boundary term is 


VC cent) dx, -*+ dx,. (2.13) 


Using the definition (2.3) of g in (2.13) gives 


ff Fe tan) AZ OX5 "GX, 


which is the right side of equation (2.1). This completes the proof of the change of 
variables formula. 


3. In our proof of the change of variables formula, we assumed neither that ¢ is 
one-to-one, nor that it is onto. We claim: 

A mapping ¢ having properties i) and ii) of the change of variables theorem maps 
R” onto R”. 
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Suppose some point y, were not the image of any x. Since ¢ is the identity 
outside the unit ball, y, would lie inside the unit ball. Since @ maps |x| < 1 into a 
closed set, it would follow that some ball B, centered at y, would be free of 
images of g. Now take any function f supported in the ball B,, whose integral is 
nonzero: 


[fay #0. (3.1) 


By the change of variable formula 


[fl e(x)) Jae = [fay #0. (3.2) 


Since the range of g avoids B,, and since the support of f lies in Bo, the 
integrand on the left in (3.2) is identically zero; then so is the integral. This 
contradicts (3.1), and so the claim is established. 


Intermediate Value Theorem:. Let g be a continuous map of the unit ball in R” into 
IR” that is the identity on the boundary: 


g(x) =x for |x| = 1. 


Then the image of ¢~ covers every point in the unit ball. 


Proof: Extend @ to be the identity outside the unit ball. Then approximate the 
extended map by differentiable maps, each the identity outside the unit ball. 
According to our claim, each of these maps covers the unit ball. By compactness, 
so does their limit. 7 


The following well-known argument shows how to deduce the Brouwer fixed 
point theorem from the intermediate value theorem. 

Let w& be a continuous mapping of the unit ball into the unit ball; we claim that 
it leaves a point fixed. If not then for every x there is a ray from (x) through x. 
This ray pierces the unit ball at a point that we denote by g(x). Clearly, ¢ is a 
continuous mapping; it is the identity for x on the unit sphere and maps the unit 
ball into the unit sphere. This contradicts the intermediate value theorem. Es 


4. The Brouwer fixed point theorem has many analytical proofs. How do they 
compare with the present one? Hadamard [3] employed the identity (2.10) about 
the Jacobian matrix; so did Dunford-Schwartz [2, pp. 467-470]. 

Samelson [6] used Stokes’ theorem to give an extremely short proof of the 
Brouwer fixed point theorem. This proof was rediscovered by Kannai [5]. Accord- 
ing to Laurent Schwartz, as related by Haim Brézis, such a proof was current in 
Paris in the thirties. 

Baez-Duarte [1] proved formula (2.1) using exterior forms and Stokes’ theorem 
and deduced from it the intermediate value theorem. My deduction is the same as 
Baez-Duarte’s. 

The integration of exterior forms over chains presupposes the change of 
variable formula for multiple integrals. It is amusing that the change of variables 
formula alone implies Brouwer’s theorem. 

In conclusion we call attention to Erhardt Heinz’s beautiful analytic treatment 
of the Brouwer degree of a mapping. 
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Random Walks and Plane Arrangements 
in Three Dimensions 


Louis J. Billera, Kenneth S. Brown, and Persi Diaconis 


1. INTRODUCTION. The geometry of hyperplane arrangements in Euclidean 
space is a rich subject, which touches geometry [14], combinatorics [21], and 
operations research [23]. Probability was introduced into the subject by Bidigare, 
Hanlon, and Rockmore [2], who found a natural family of random walks associated 
with hyperplane arrangements. These walks were studied further by Brown and 
Diaconis [6]. One reason this development is exciting is that the walks admit a 
rather complete theory. We introduce the reader to this circle of ideas by 
specializing to the 3-dimensional case (planes in R°). Here we are able to use tools 
from geometry to obtain a surprising formula for the stationary distribution of 
the walk. 


1.1. The geometric setup. Consider a collection < of n planes through the origin 
in R°. We assume throughout this paper that the intersection of the planes is {0}. 
In particular, m > 3. It is useful to picture the arrangement via the intersection 

of the planes with the unit sphere S*. See Figure 1 for the case where . consists 


Figure 1. Three great circles on S’. 


of the three coordinate planes. The picture shows the Northern Hemisphere, 
viewed from above the North Pole; thus the outer circle is the equator z = 0. 
Adding the plane z = x + y gives the picture in Figure 2. 


Figure 2. Four great circles on S?. 
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The n great circles corresponding to the arrangement .” decompose the sphere 
into cells: There are f, vertices, f, edges, and f, regions. When n = 3, for 
example, one has f) = 6, f, = 12, and f, = 8. Only the cells in the Northern 
Hemisphere are visible in the picture, but the Southern Hemisphere has exactly 
the same geometry. Notice that all the regions are triangular in Figure 1. In Figure 
2 there are f, = 14 2-dimensional cells: 8 triangles and 6 quadrilaterals, half of 
which are visible. | 

The cell decomposition induced by an arrangement has the following special 
property: Given a region C and a vertex v, there is a unique region C' adjacent to v 
that is closest to C, in the sense that C' is separated from C by the minimum number of 
great circles. The region C’ is said to be the projection of C on v and is denoted uC. 
We explain in Section 3.2 why it is uniquely defined. Figure 3 shows an example; 
here C’ is at distance two from C. 


V 


Figure 3. The projection of C on v. 


1.2. A random walk on the regions. Bidigare, Hanlon, and Rockmore [2] used the 
projection operators to define the following walk on the regions: If the walk is in 
region C, choose a vertex v at random and move to the projection C’ = vC. (We 
assume here that the vertices are chosen uniformly, so that all f) vertices are 
equally likely.) This walk is described mathematically by its transition matrix K. The 
rows and columns of K are indexed by the regions, with K(C, C’) being the chance 
of moving from C to C’ in one step. Thus 


1 , 
K(C,C’) = —A(C,C’), 
0 


where A(C, C’) is the number of vertices v of C’ such that vC = C’. | 

Suppose, for instance, that . consists of the three coordinate planes. The 8 
triangular regions are the intersections with the sphere of the 8 orthants in R°, as 
indicated in Figure 4. For example, the region + — + corresponds to the orthant 
x > 0, y <0, z > 0. The matrix A is shown in Table 1; one has K = 2A. 


Figure 4. The regions correspond to orthants. 
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TABLE 1 The matrix A. 


tet +4+—- 0° F-4+ F5-- -F+4+ -+- =H 4+ === 
ee ae 3 1 1 0 1 0 0 0 
Se ee 1 3 0 1 0 1 0 0 
aS tae 1 0 3 1 0 0 1 0 
eos 0 1 1 3 0 0 0 1 
=P ah 1 0 0 0 3 1 1 ) 
a or 0 1 0 0 1 3 0 1 
ei 0 0 1 0 1 0 3 1 
=a 0 0 0 1 0 1 1 3 


Notice that for any region C, the three vertices of C satisfy vC = C; this 
explains the diagonal entries of A. Projection onto each of the remaining three 
vertices flips C to an adjacent region, thus accounting for the three 1’s in each row. 
At each step, then, the walk stays in its current region with probability 1/2 and 
otherwise moves to a randomly chosen adjacent region. This example is unusual, in 
that one never moves a distance of more than 1 in any step; typically the walk is 
much more vigorous. 

The / power of the matrix K gives the transition probabilities after / steps; for 
example, 


K*(C,C’) = ), K(C,C")K(C"",C’), 
Cc" 


which is the chance of moving from C to C’ in two steps. After all, to get from C 
to C’ in two steps the walk must go to some C” and then to C’. 

A fundamental theorem of Markov chain theory [17, Theorem 4.1.4] implies 
that K‘(C,C’) tends to a limit 7(C’), independent of the starting region C: 


K'(C,C') > r(C’) asl > ~. 


(The theorem requires a mild regularity condition, which is satisfied by our chain.) 
Here 7 is a probability distribution on the set of regions, and 7(C’) represents the 
chance that the random walk is in C’ after a large number of steps from any 
starting region C. The distribution 7 is called the stationary distribution of the 
walk. It can be characterized as the unique probability distribution satisfying 


mL RCE SG) = 7(C’) (1) 


for all C’. This says that 77, viewed as a row vector, is a left eigenvector for K with 
eigenvalue 1. 


1.3. Analysis of the walk. Bidigare, Hanlon, and Rockmore [2] determined the 
eigenvalues of K, which turn out to be real (and even rational): 


(a) 1 is an eigenvalue of multiplicity 1. 
(b) For each plane H €.y, there is an eigenvalue 


# of vertices on HM S? 
a 
0 


of multiplicity 1. 
(c) 2/fy is an eigenvalue of multiplicity (f, — 2)/2. 
(d) 0 is an eigenvalue of multiplicity (f, — 2)/2 —n+ 1. 
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A different proof of this can be found in [6], where it is shown further that K is 
diagonalizable. Yet another proof is given in [5]. For a simple example, take n = 3 
again; the 8 eigenvalues are then 1, 4, =, 4, +, 7, 4, 0. 

A second result of [2, 6] is a surprisingly simple estimate relating the eigenvalues 
to the rate of convergence to stationarity. Let K/ be the distribution of the walk 
started from C after / steps, i.e., KL(C’) = K'(C,C’). We measure convergence 
rate by means of the distance ||K~ — a|| defined as follows: If P and Q are 


probability distributions on a finite set X, then 
||P — Q|| = max| P(A) — Q(A)]|. (2) 
ACX 
The version of the convergence rate estimate given in [6] is 


| Ke > 7| Ss y Nu (3) 
Hed 


for any starting region C. When n = 3, for example, the distance to stationarity is 
bounded by 3(2) after J steps. 

Finally, a method is given in [6] for computing the stationary distribution 7, 
though it is not always easy to get a useful formula for 7 by this method. Our main 
result is an explicit formula for 7 in the 3-dimensional case: 


Theorem 1. Let be an arrangement of n planes in R° whose intersection is {0}. Let 
q be the stationary distribution of the random walk on regions of the sphere, as 
described above. If C is a region with i sides, then 


L=Z 


on Sey 


We find this result quite surprising. It is not even obvious to us why all i-gons 
should have the same stationary probability, let alone why this probability should 
be proportional to i— 2. In our n =3 example, the theorem says that the 
stationary distribution is uniform: 7(C) = 3 for each of the 8 triangles C. 


1.4. Organization of the paper. In Section 2 we make several remarks about the 
main theorem and illustrate it with examples. In Section 3 we explain how to 
describe vertices, edges, and regions by means of sign sequences; this description is 
used in the proof of our main theorem. Section 4 gives some background on 
hyperplane arrangements in order to put the results stated in Section 1.3 into a 
broader context. We also state in Section 4 some of the results of [2, 6] that were 
specialized to the 3-dimensional case in Section 1.3. In Section 5 we show how the 
general theory specializes to card shuffling and random tiling models. We work out 
some 3-dimensional cases of this in detail. Sections 4 and 5 are included mainly for 
motivation; they are not needed for the proof of Theorem 1. The proof of the 
latter begins in Section 6, where we give a geometric description of the matrix K. 
With this description available, it is quite easy to complete the proof; we do this in 
Section 7. Finally, Section 8 contains some pointers to the literature on plane 
arrangements in R°. 

The key arguments of this paper represent the combinatorial essence of a 
hyperplane arrangement as a collection of sequences of signs; see Section 3. This 
can be abstracted to the notion of an oriented matroid. The present project makes 
a nice introduction to these ideas. 
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2. EXAMPLES AND REMARKS 


2.1. General position. The 7 planes are said to be in general position if no three of 
the planes have a nonzero intersection or, equivalently, if only two great circles 
pass through each vertex on the sphere. Many of the examples in this paper are 
in fact in general position; but the simplicial arrangements to be discussed in 
Section 2.3 are never in general position unless n = 3, nor are the arrangements in 
Section 5 below. | 
It is easy to count cells in the general position case. One finds 
fo=n(n—-1), fi =2n(n—1), and f, =n(n—1) +2. 

n 
2 
vertices, whence the first equation. For the second equation, note that each of the 
n great circles is cut into 2(n — 1) arcs by the other n — 1 great circles. [Alterna- 
tively, count the vertex-edge pairs in two different ways to get 2f, = 4f), so 
that f, = 2f).] Finally, the third equation can be proved by a straightforward 
inductive argument, or it can be deduced from the first two via Euler’s relation 
fo-fith, = 2. 


The list of eigenvalues A and multiplicities m, given in Section 1.3 becomes 


In fact, there are ( pairs of great circles, each determining a pair of antipodal 


And the convergence rate estimate (3) is 


l 
IKE — a <n(=). 


This shows that the distance to the stationary distribution is small after / = 2 steps 
if n is large. In Section 6.3, after we have a geometric description of the transition 
matrix K, we describe a family %, of arrangements of n planes in general position 
such that ||K. — a|| = c > 0 for all n. In this sense the walk is close to stationary 
after 2 steps, but not after 1 in general. 

Another feature of the general position case is that if one wants to carry out the 
random walk algorithmically, it is quite easy to pick a random vertex: First pick a 


pair of great circles at random, so that all 4 are equally likely; this defines a pair 


of antipodal points. Now choose one of these two points with probability 5. 
Finally, Theorem 1 gives the following formula for the stationary distribution in 
the general position case: 
. P20 
ee 2(n(n — 1) — 2) 
if C is an i-gon. 
Here are some specific examples: 


Example I. For n = 3, the arrangement is combinatorially equivalent to that of 


Figure 1. There are 8 triangular regions, and the stationary distribution is 7(C) = 3 
for all C. 
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Example 2. For n = 4 planes in general position (Figure 2) there are 14 regions: 8 
triangles with stationary probability 7(C) = = and 6 quadrilaterals with w(C) 
2 


20° 
Example 3. With n = 5 planes in general position, the situation is as in Figure 5 


up to isomorphism. There are 10 triangles with w(C) = 3, 10 quadrilaterals with 
3 


m(C) = <, and 2 pentagons with 7(C) = + 


36° 


Figure 5. Five great circles in general position. 


Example 4. With n = 6 or more planes the number of regions of each type can 
vary with the actual arrangement. From the catalogue in Griinbaum [12, 14] one 
sees that up to combinatorial equivalence there are exactly four general position 
arrangements of 6 planes, as shown in Figure 6. The number of i-gons (i = 3, 4, 5, 6) 


BOSS 


Figure 6. Six great circles in general position. 


for each of these four arrangements is shown in Table 2. Each arrangement has 
n(n — 1) + 2 = 32 regions. For each arrangement, the stationary probability of an 


ee: 
i-gon is a Note that, for some values of i, some arrangements don’t have any 
i-gons. Nonetheless, the stationary probabilities sum to 1. 


TABLE 2 The number of i-gons. 
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2.2. The Euler relation. For a given arrangement of n planes, let p, be the 
number of i-sided regions. Since the stationary probabilities in Theorem 1 must 
sum to 1, we get 

“(i -2) 

ris 6 es 
We now show that this relation is equivalent to the Euler relation f, — f,; + f, = 2. 
Rewrite (4) as Lip, — 20 p; = 2(f) — 2). The first sum is the number of pairs 
consisting of a region and an edge of that region; since each edge is on exactly two 
regions, this sum equals 2f,. The second sum is f,, so (4) becomes 2f, — 2f, = 
2( fy — 2), which is Euler’s identity. 


2.3. Simplicial arrangements. The arrangement .¥ is said to be simplicial if every 
region on the sphere is a triangle. The simplicial case has long held a special 
fascination for geometers [13, 14]. And it is of special interest for the probability 
story also. In fact, every arrangement has some triangles (at least n of them by 
Levi’s theorem [14, p. 25]); so Theorem 1 has the following consequence: 


Proposition 1. The random walk on regions has a uniform stationary distribution if 
and only if the arrangement is simplicial. 


There are three known infinite families of simplicial arrangements and 90 
“sporadic” examples. It has been conjectured that there are no more infinite 
families and at most finitely many additional sporadic examples; see [14, p. 8, 
Conjecture 2.1]. A catalogue showing the first 89 sporadic examples can be found 
in [13], and the 90th is in [14]. (Note: It is stated in [14] that there are 91 sporadic 
examples, but the arrangements called A,(17) and A,(17) in [13] have been shown 
to be isomorphic; see [15, p. 59].) Many of the examples have a great deal of 
symmetry, and one is therefore not surprised when the stationary distribution turns 
out to be uniform. But there is one known example ((14, pp. 8—9]) of a simplicial 
arrangement whose group of combinatorial symmetries is trivial. Nevertheless, the 
walk on the regions has a uniform stationary distribution. This example, which has 

= 28, is shown in Figure 7 in its projective representation. We discuss such 


SK, 
SNK 
Lem 


Figure 7. A simplicial arrangement with no symmetry (projective picture); the line at infinity is 
included. 
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representations in Section 8; for now, suffice it to say that we get the spherical 
representation by drawing the lines in Figure 7 as great semicircles on the 
Northern Hemisphere. Parallel lines yield great semicircles with the same end- 
points on the equator. 


2.4. Arrangements of pseudocircles. The theory developed in [6] and in the 
present paper goes through if the family of great circles is replaced by a family of 
“pseudocircles,” i.e., simple closed curves on the sphere that are not necessarily 
great circles but merely intersect as great circles would. See [4, 14, 23] for the 
precise definition and many examples. The reason the theory goes through is that, 
as we remarked in the introduction, all of our work is based on the oriented 
matroid underlying the arrangement, and this exists for arrangements of pseudo- 
circles also. Generalizing in this way is of interest because it vastly increases the 
supply of examples. In particular, one obtains seven new infinite families of 
simplicial arrangements; see [14, 11]. 


3. AN ENCODING OF THE CELLS. In Figure 4 we labeled each region on the 
sphere by a vector (0,, 05, o;) of signs +, —. In this section we give a similar 
encoding of the regions, as well as the vertices and edges, for an arbitrary 
arrangement. This encoding is used in the proof of our main theorem in Section 7. 
The sign vectors are easier to think about if we replace each cell e by the cone 
over é, i.e., by the set of positive scalar multiples of e. We begin by establishing the 
language for talking about these cones. 


3.1. Chambers. The open regions into which R? is cut by » are called chambers. 
For example, if .” consists of the three coordinate hyperplanes, the chambers are 
the 8 open orthants. The regions on the sphere that we have been discussing are 
simply the intersections of the chambers with the sphere. 

We can describe a chamber by specifying, for each H €.%, which side of H the 
chamber is on. To formalize this, write .« = {H,},;., and let H;* and H; be the 
two open halfspaces determined by H,. (The choice of which one to call H;" is 
arbitrary but fixed.) Then the chambers are precisely the nonempty sets of the 
form 

C= He, 
rel 
where o; = +. The vector of signs (0;);- , provides a succinct description of C. 
Figure 8 illustrates this (using the spherical picture) for the four planes x = 0, 
y =0, z=0, z-x-—y=0. The region labeled +++ +, for instance, corre- 
sponds to the chamber x > 0, y > 0, z > 0, z—x-—y>0. Notice that the sign 
vectors of the remaining chambers can be filled in once ++++ has been 
identified; just move from chamber to chamber, crossing one plane at a time and 
recording the appropriate sign change. 


Figure 8. Sign vectors for four planes in general position. 
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3.2. Faces, products, and projections. The chambers are defined by finitely many 
linear inequalities, so they are polyhedra. They have faces, which are the nonempty 
sets obtained by changing zero or more inequalities to equalities. For example, the 
chamber + + + + in Figure 8 has a face defined by x = 0, y > 0,z >0,z—-—x-y 
> 0. We say that this face has sign vector 0 + + +. In general, the sign vector of a 
face F is the vector (a;) such that 


F= ()\H%, 
Le] 
where o; € {+, —,0} and H = H,. In Figure 8, the chamber + + + + has three 
faces of codimension 1, with sign sequences 0+ ++, +0+ +, and +++0. 
These correspond to the three sides of the triangle labeled + ++ + in Figure 8. 
Note that + + 0 + does not occur, because it would correspond to the inconsis- 
tent set of conditions x > 0, y > 0, z=0,z—-x-y>0. 

Let & be the set of chambers and let ¥ be the set of faces of chambers; note 
that @ C¥ according to our conventions. Somewhat surprisingly, there is a 
naturally defined product on ¥ that makes it a semigroup: Given F,G © ¥ with 
sign vectors o(F'), o(G), their product FG is the face with sign vector 


o(F) if ao(F) #0 
o,(FG) = 
a(G) if o(F) =0. 
This has a geometric interpretation: If we move on a straight line from a point of F 
toward a point of G, then FG is the face we are in after moving a small positive 
distance. 

Given fF © ¥ and C € @, their product FC is again a chamber, called the 
projection of C on F. It can be characterized as the nearest chamber to C having 
F as a face. Here “nearest” refers to the number of hyperplanes in © separating C 
from FC. This proves the existence of the projection operators that were used to 
construct the random walk in Section 1. 

We continue to identify chambers and faces with their intersections with the 
sphere. Thus we have a cell decomposition of the sphere, with a semigroup 
structure on the set of cells. The next subsection illustrates the use of sign vectors 
and the semigroup structure. 


3.3. The diameter of the walk in R°. Consider an arrangement in R° and the walk 
on regions defined in Section 1. It is easy to see that there is an integer / > 0 such 
that K'(C,C’) > 0 for all C, C’; in other words, one can get from any region to 
any other in / steps. The smallest such / is called the diameter of the walk. As an 
example, consider the arrangement of three planes shown (in its spherical repre- 
sentation) in Figure 1. By inspection, to get from a chamber to its antipode by 
steps of the walk (through a choice of vertices) takes three steps, and this is in fact 
the diameter. 


Proposition 2. The diameter of the walk based on n planes in R? is always either 2 or 
3. It is 2 ifn = 5 and the planes are in general position. 


Proof: One can never get from a region C to its antipode —C in one step; in fact, 
each vertex v has at least two 0’s in its sign vector, so multiplying by v cannot 
change all the signs of C. This shows that the diameter is at least two. 

Consider two regions C, C’, with C’ an m-gon. Assume, to simplify notation, 
that the sign vector a’ of C’ has o; = + for all j and that the first m components 
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correspond to the m sides of C’. Then C” is the unique region whose sign vector 
has + in its first m positions. If m > 4, we can find vertices v,w of C” such that v 
is on two of the sides of C” and w is on two different sides. Thus the sign vector of 
v has 0 in two of the first m slots and + in the rest, and similarly for w, with the 
0’s of v and w in disjoint positions. The product vwC in the semigroup ¥ therefore 
has + in its first m positions, i.e., pwC = C’, and we can get from C to C’ in two 
steps. This does not work if C’ is a triangle (m = 3), but in that case we can get 
from any C to C’ by using all three of the vertices of C’. Thus we have proved that 
the diameter is at most 3. 

Finally, suppose that the planes are in general position, that m = 3, and that 
n > 5. Let w be a vertex on two of the great circles other than the three forming 
the sides of C’. In view of general position, the first 3 components of the sign 
vector of w are nonzero. Replacing w by —w if necessary, we may assume that 
two of these 3 components, say the first two, are +. Let v be the vertex of C’ 
whose sign vector has 0 in the first two slots. Then vw has + everywhere, so we 
can get from C to C’ in two steps. iz 


4. HYPERPLANE ARRANGEMENTS. This section and the next are intended to 
provide some context for the seemingly strange random walk introduced in Section 
1. We consider here arrangements of hyperplanes in spaces of arbitrary finite 
dimension d. The reader who wants to proceed to the proof of the main theorem 
may skip ahead to Section 6. 

A good reference for the theory of hyperplane arrangements is the book by 
Orlik and Terao [20]. See also [19, 4, 23] and, for a concise summary of the basic 
concepts, [6, §2]. Throughout this section .” = {H,}.. , denotes a finite set of linear 
hyperplanes (subspaces of codimension 1) in a finite dimensional real vector space 
V. One often assumes that the intersection of the hyperplanes is the trivial 
subspace, as we have been assuming when V = R?°. There is no loss of generality in 
making this assumption; for if it fails, then we can replace V by the quotient space 
V/V, where V, = (\;-,H,. 


4.1. Chambers, faces, and products. The open regions into which V is cut by ¥ 
are again called chambers. More precisely, the chambers are the connected 
components of the complement of U,.,H; in V. As in the 3-dimensional case, 
chambers and their faces are encoded by sign vectors (a;);- ;, where o, € {+, —, O}. 
The definition of the product of faces also remains unchanged, so we have a face 
semigroup ¥ containing the set @ of chambers, with FC © @ for FE F,CE@. 


Remark. The faces are in 1-1 correspondence with their intersections with the 
unit sphere. If ;., H; = {0}, these intersections give a cell-decomposition of the 
sphere, as in the case V = R?®. 


4,2. A walk on the chambers. We may specify a random walk on @ by assigning a 
weight w, to each F € FY. These weights satisfy w, > 0 (many may be zero) and 
LregWpr = 1. Given a starting chamber Cj, the walk proceeds by choosing from ¥ 
with replacement, with w, the chance of picking F each time. This generates 
choices F,, F,, F;,.... The walk proceeds by multiplying by F, at stage i, 1e., by 
projecting onto F,. Thus it goes 

Cie BC Pat Couasand tg 28 BC paws 


lol 
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The chance of moving from C to C’ in one step is 


K(C,C’) = Da (5) 


The walk of Section 1 is based on n planes in R’ with w, = 0 unless F is a half 
line (so that its intersection with the sphere is a point), and w, = 1 /Jo for all half 
lines. 


4.3. Analysis of the walk. As we remarked in the introduction, the hyperplane 
walks admit a rather complete theory. Bidigare, Hanlon, and Rockmore [2] give all 
the eigenvalues of the matrix K in a simple closed form. (We stated a special case 
of their result in Section 1.) Brown and Diaconis [6] prove diagonalizability of K 
and calculate eigenvalues using a chain complex given by the decomposition of the 
sphere by the hyperplanes. Brown [5] translates the walk to a semigroup setting 
and works out the appropriate Fourier analysis to get another proof of these 
results. The following result from [6] describes the stationary distribution and gives 
a bound on the convergence rate: 


Theorem 2. Let be a hyperplane arrangement. Let {w,} be a probability distribu- 
tion on the set F of faces. Let K(C,C') be defined by (5). Then 


(a) K has a unique stationary distribution 7 if and only if the weights {w,} are not 
concentrated on the faces in a single hyperplane, i.e., if and only if for each 
H €& there is F ¢ H with w, > 0. 

(b) If the condition in (a) holds, then 7 may be described as follows: Sample 
without replacement from ¥ according to the weights w,. This generates an 

ordering F,, F,,...,F,, of {F €F:w, > 0}. The product FF, +: F,,Co is 

independent of C, and is a chamber distributed according to 7. 


(c) If the condition in (a) holds, then for any C, € € and positive integer I 


IKé, ~ | < Py Ni, Where Ay = D> wr. 
FCH 


Part (c) of Theorem 2 immediately gives the convergence rate bound stated in 
Section 1. And part (b) of Theorem 2 can be used to get the explicit formula for 7 
in the 3-dimensional case, as stated in Theorem 1. Indeed, we first discovered the 
formula by doing exactly that. It turns out, however, that one can give an easier 
and completely self-contained proof of Theorem 1 by a direct argument based on 
equation (1). That is what we do in Section 7. 


5. TWO EXAMPLES. There are many hyperplane arrangements where the cham- 
bers can be labeled in a natural way by familiar combinatorial objects such as 
permutations or trees and the walk described in Section 4.2 captures a natural 
mixing process. In this section we describe two such examples. The first is the 
braid arrangement, for which the chambers correspond to permutations. The 
action of faces on chambers gives natural shuffling schemes, such as the usual 
method of riffle shuffling a deck of cards, or a list rearrangement scheme used in 
computer science where a card is removed and replaced on top. The second 
example shows how hyperplane arrangements are related to tilings. In particular, 
we describe in some detail a hyperplane arrangement in R° whose chambers 
correspond to rhombic tilings of a 10-gon. See [2, 6] for further examples. 
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5.1. The braid arrangement and card shuffling. The braid arrangement in R¢ 
consists of the & hyperplanes H,, = {(x,,...,%q) 2%; =x} (i <j). The cham- 


LJ 
bers are associated with a common ordering of the coordinates and so with one of 
the d! permutations. When d = 4, for example, one of the 24 chambers is the 
region defined by x, > x, > x, > x3, corresponding to the permutation 1423. 

The hyperplanes H;, intersect in the line x, = --- = x,. The braid arrangement 
therefore gives rise to an arrangement in a (d — 1)-dimensional space, as ex- 
plained at the beginning of Section 4. When d = 4, the resulting arrangement of 
6 planes in R° may be pictured as in Figure 9. The great circle corresponding to 
Ff,, is labeled ij. (The equator is not one of the great circles of the arrangement.) 
Each chamber is labeled with the associated permutation. 


Figure 9, The braid arrangement when d = 4. 


The faces of a chamber C are-obtained by changing to equalities some of the 
inequalities defining C. For example, the chamber x, >x,>x, > x, has a face 
given by x, > x3 > x, =x,, which is also a face of the chamber x, > x3 > x4 > Xx. 
This common face is represented by the edge between 2314 and 2341 in Figure 9. 
Similarly, the vertex labeled 2 in the figure corresponds to the face x, > x, =x; = 
X43; it is a face of six chambers, corresponding to the six possible orderings of the 
indices 1, 3, 4. 

It is useful to encode the system of equalities and inequalities defining a face F 
by an ordered partition (B,,...,B,) of {1,...,d}. Here B,,...,B, are disjoint 
nonempty sets whose union is {1,..., d}; they are called the blocks of the partition, 
and their order counts. For example, the face x, > x, >x, =x, corresponds to 
the 3-block ordered partition ({2}, {3}, {1, 4}), and the face x, > x, =x; = x, [vertex 
2 in Figure 9] corresponds to the 2-block ordered partition ({2}, {1, 3, 4}). Notice 
that there is also a (unique) 1-block ordered partition, corresponding to the face 
X, =X, =X, =x,. When we pass from R* to a 3-dimensional quotient to make the 
hyperplanes have trivial intersection, this face becomes {0}. It does not show up in 
Figure 9 because its intersection with the sphere is empty. 

The action of faces on chambers is easily pictured by thinking of a permutation 
T = (7,,...,7,) as the set of labels on a deck of d cards, with the card labeled 7, 
on top, and so on. The ordered partition B =(B,, B,,...) operates on 7 by 
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removing cards with labels in B, and placing them on top (keeping them in the 
same relative order), then removing cards with labels in B, and placing them 
next, and so on. Suppose, for example, that d = 10, 7 = (1, 7, 3, 9, 10, 4, 5, 2, 6, 8), 
and B = ({2, 5}, {3, 4, 6, 10}, {7}, {1, 8, 9}); then B acting on 7 gives 
(5, 2, 3, 10, 4, 6, 7, 1, 9, 8). 

The walk on chambers described in Section 4 now yields a variety of shuffling 
schemes, depending on the choice of weights w,. One example that has received 
much attention is the “Tsetlin library,” or ‘“random-to-top” shuffle. Here one 
assigns a positive weight w, to each 2-block ordered partition ({i}, {1,..., ad} \ {i}, 
and weight 0 to the other faces. When d = 4, for example, the vertices with 
positive weight are those labeled 1,2,3 in Figure 9 and the one that would be 
labeled 4 if it were visible. [The opposite vertex is shown as an open circle.] In the 
resulting random walk, a card is repeatedly picked at random according to the 
weights w, and is replaced on top. This walk is useful in connection with 
self-organizing list-management schemes [9]. Imagine, for instance, a stack of d 
files, where file i is used with frequency w,. Each time a file is used, it is replaced 
on top of the stack. After the process has been running for a long time, the most 
frequently used files will tend to be near the top. 

The basic walk studied in this paper, where the vertices are weighted uniformly, 
is itself quite interesting when applied to the braid arrangement. Here, we assign 
weight 1/(2¢ — 2) to each of the 24 — 2 two-block ordered partitions. The corre- 
sponding shuffling mechanism consists of “inverse riffle shuffles.” In an ordinary 
riffle shuffle a deck of cards is divided into two piles, which are riffled together. 
The inverse chooses a set S of cards, which are removed (“unriffled’”) and placed 
on top; the 2“ — 2 proper nonempty subsets S are all equally likely to be unriffled. 
This is very closely related to a standard model for riffle shuffling. See [2, 6] for 
further details and references to earlier work. 

To conclude, let us apply the results of Section 1.3 to the walk when d = 4 and 
the vertices are chosen uniformly. There are 14 vertices, 36 edges, and 24 regions. 
The stationary distribution is uniform because each of the regions has three sides. 
The eigenvalues and multiplicities (A, m,) are (1, 1), (4, 6), (4, 11), and (0, 6). After 
shuffling / times, the distance to stationarity satisfies 


a 1 
IK! — al < 65}. 
7 


This bound shows that / = 5 shuffles suffice to make the distance to stationarity 
smaller than 1/10. 


5.2. Arrangements and tilings. We discuss now an example of an arrangement in 
_ R° in which the chambers correspond to certain tilings of a centrally symmetric 
10-gon. The walks described in this paper can then be used to generate random 
tilings. Such tilings are of interest to physicists who study quasicrystals; see [8, 16] 
and the references cited there. This is an example of a very general theory whereby 
hyperplane arrangements can be associated with a set of tilings of special convex 
polytopes called zonotopes [3, §4]. A good introductory reference for the material 
in this subsection is [23, Lecture 7] 

Given n pairwise linearly independent vectors v,,...,v, € R%, we define a 
convex polytope known as a zonotope by 


LAD igsiies OS {DAwlo <A,< 1}. 
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Figure 10. Five vectors in R? and the 10-gon they generate. 


We call the v, the zones of Z(v,,...,v,). When d = 2, Z(v,,...,v,) is a centrally 
symmetric 2n-gon; this is illustrated in Figure 10 for the case d = 2 and n = 5, 
where the resulting zonotope is a 10-gon. 

We are interested in tilings of this 10-gon by the parallelograms generated by 


the : = 10 pairs v,,u,, 1 <i <j <5. Since we may take all the v; to have the 


same length without loss of generality, we call these tilings rhombic tilings. Two 
such rhombic tilings are illustrated in Figure 11. It is a consequence of the work of 
Billera and Sturmfels [3] that the set of all such tilings of this 10-gon corresponds 


to the chambers of an arrangement of (5 | = 10 hyperplanes in R*. We describe 


this arrangement in some detail now. 


BS 


Figure 11. Two tilings of the 10-gon by parallelograms. 


Every triple v,,v,, u, of the 5 given vectors, 1 <i <j <k <5, satisfies a unique 
(up to nonzero scalar) linear dependence relation. We write it as a relation among 
U1, Up, V3, U4, Us, SO it is given by a vector of length 5. Let 7 = {i, j, k} and let the 
dependence be given by z* € R°; thus L}_,z7v, = 0 and zj = 0 if / € 7. The span 


V of the z’, over all triples + C {1, 2,3, 4, 5}, is the nullspace of the rank 2 matrix 
X, Xo Xs Xq Xz 
Yi Yo Y3 Ya Ys} 


where vu; = (x;, y;), and so is 3-dimensional. For each triple +, we denote by 
H™ CR? the hyperplane having z’ as its normal (i.ec., H7 = {x € R°|(x, z7) = O}). 
The hyperplanes {H7} have a 2-dimensional intersection V ~ (the row space of A). 
We therefore get an arrangement in the 3-dimensional space R°/V + (which we 
may identify with V), as explained at the beginning of Section 4. This may be 
pictured as in Figure 12. Each great circle is labeled by the corresponding triple r. 
Note that the 4 great circles corresponding to triples 7 C {2,3, 4,5} intersect at a 
pair of antipodal vertices, one of which is labeled 1 in Figure 12. This can be 


A= 
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Figure 12. The arrangement associated to tilings of a 10-gon. 


explained by straightforward linear algebra, which we leave to the interested 
reader. [Hint: Consider the matrix obtained from A by deleting the first column.] 
The other multiple intersections (with labels 2,...,5) can be explained in a similar 
way. 

We claim that the chambers of this arrangement correspond to the rhombic 
tilings of the 10-gon. To understand why this might be so, take any point 
6 = (6,,...,05) = R° not on any of the hyperplanes H”’. Lift the zones vu, to 
v, = (x,, y;, 6), i= 1,...,5, and form the 3-dimensional zonotope Z := 
Z(D,,..., 03) C R®. One can show that each 2-dimensional face of Z is a parallelo- 
gram. [Brief explanation: One first observes that the 2-dimensional faces of a 
3-dimensional zonotope are translates of planar zonotopes spanned by subsets of 
the generating zones; see [23, p. 205]. These are all parallelograms unless three of 
the lifted zones v;,,v;,v, are linearly dependent. A linear dependence relation 
among these lifts would be a multiple of z’, where 7 = {i, j, k}. This would imply 
0 © H’, contradicting the choice of 6.] Projecting the faces on the bottom of 
Z (i.e., the faces whose outer normal has a negative third coordinate) onto the 
10-gon Z(v,,...,Us), one gets a rhombic tiling of the 10-gon. Further argument 
shows that any other @ in the same chamber gives the same tiling and that all 
rhombic tilings of the 10-gon arise in this way. 

An example of what might happen if the original zonotope were a hexagon and 
the lifted zonotope a 3-cube is shown in Figure 13; depending on its orientation in 
R°, the bottom of the. cube would look like one of two tilings shown. 

A move between adjacent chambers in the arrangement in Figure 12 corre- 
sponds to a simple local change in the corresponding tiling: for some embedded 
“3-cube” in the tiling, flip just that portion of the tiling as shown in Figure 13. So 


Figure 13. A flip of a 3-cube. 
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the chamber corresponding to a tiling is an i-gon if and only if the tiling has 7 
embedded 3-cubes; for example the tiling on the left in Figure 11 corresponds to a 
triangular region, while that on the right corresponds to a pentagonal region. 

We note that the situation described here is a piece of a very general theory of 
subdivisions [3, 1]. In particular, given a d-dimensional zonotope Z with n zones in 


R“, one can associate with it an arrangement of at most ( a i hyperplanes in 


R”~“ whose chambers correspond to certain tilings of Z. In our example it turns 
out to be all the rhombic tilings, but, in general, not all tilings can be obtained by 
the lifting procedure described in the example; see [3] for details. When n = d + 3 
the derived arrangement is 3-dimensional, and so theory of the present paper 
applies. 

To conclude, let us apply the results of Section 1.3 to our walk on the tilings of a 
10-gon. There are 40 vertices, 100 edges, and 62 regions, with (p3, p4, ps) = 
(50, 10,2). The stationary probability of a tiling with i cubes is (i — 2)/76. The 
eigenvalues and multiplicities (A, m,) are (1, 1), (4, 10), (=, 30), and (0, 21). After / 
steps, the distance to stationarity satisfies 


1 1 
|K! — all < 10{ | 


6. A GEOMETRIC DESCRIPTION OF K. We return to the setting of Section 1, 
where & is a collection of planes in R° with trivial intersection, and K is the 
transition matrix of the walk on the regions of the sphere (or, equivalently, on the 
chambers in R°). Recall that, in the notation of Section 1, 


K(C,C') = = A(C,C’), (6) 


where A(C,C’) is the number of vertices v of C’ such that vC = C’. Note that 
A(C, C) is the number of vertices of C and that A(C, —C) = 0. The main result of 
this section is the following formula for A(C,C’) when C’ # +C: If k is the 
number of sides of C’ for which the sign vectors of C and C’ agree (that is, the 
number of bounding planes of C that do not separate C from C’), then 


A(C,C') =k -1. 


We prove this in Section 6.2 (Lemma 2). We begin by describing the matrix A in 
terms of lunes. 


6.1. Lunes. For a region D and a vertex v of D, let I(v, D) be the lune 
determined by v and D as in Figure 14. Thus if H, and H, are the great circles 
bounding D and passing through v, and if H,;’ is the hemisphere defined by H, 


Figure 14. The lune /(v, D). 
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that contains D (i = 1,2), then J(v, D) is the intersection H;}N Hz. Given two 
regions C,C’, we define the /unacy of C with respect to C’ to be the number of 
vertices v of C’ such that C Cc I(v, C’). 


Proposition 3. For any two regions C,C', A(C,C') is equal to the lunacy of C with 
respect to C’. 


Proof: Assume, to simplify notation, that the sign vector of C’ consists of all +’s. 
For a vertex v of C’, we have vC = C’ if and only if the sign vector of vC is + in 
the positions corresponding to the sides of C’. Now v has + in all of these 
positions except two, where it has 0; so vC = C’ if and only if C already has + in 
the two positions corresponding to the sides of C’ that intersect at v. But this says 
precisely that C C I(v, C’). zi 


Understanding K, then, reduces to calculating lunacy. Figure 15 shows a simple 
example; here C is in the lune /(v,C’) but not the other two C’-lunes, so 
A(C, C’) = 1. On the other hand, C’ is in two of the four C-lunes, so A(C’, C) = 2. 
Note the lack of symmetry. 


Figure 15. A(C,C’) = 1. 


There is a similar description of the transition matrix for the higher-dimensional 
hyperplane walks described in Section 4. One replaces the lunes by sets JCF, C), 
where C is a chamber, F is a face of C, and /(F,C) is an intersection of open 
half-spaces, one for each bounding hyperplane of C. What’s special about the 
3-dimensional case, however, is that lunacy is easy to compute. The computation 
depends on a simple geometric fact about polygons, related to “shellability.” 


6.2. Shellability and the calculation of lunacy. Consider a planar convex polygon 
P whose sides are extended to form lines J/,,...,/,. If the polygon is viewed from 
any point x outside of it, one sees a contiguous set of its edges, and these are 
precisely the edges whose lines separate x from P. See Figure 16; here it is 


Figure 16. The sides visible from x. 
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assumed that P is on the positive side of each /,, and the lines are labeled with + 
to indicate which side x is on. 
All of this works equally well on the sphere: 


Lemma 1 (Shelling lemma). Given a spherical polygon bounded, in cyclic order, by n 
great circles l,,...,1,, suppose that the interior of the polygon is on the positive side of 


each of these circles. Then any point on the sphere not on any of the circles is on the 
positive side of a cyclically contiguous subset of them. 


An analogue of the shelling lemma holds for convex polytopes in d-dimensional 
space [7] and for oriented matroids [18]. In the former case, contiguity is replaced 
by “shellability” of the set of visible faces, whence the name “shelling lemma.” The 
latter case implies that everything we have done works for arrangements of 
pseudocircles (Section 2.4). See [23] and [4] for discussions of all of these topics. 
We omit a proof of the shelling lemma because it is treated thoroughly in the 
references cited and because, in the present low-dimensional case, it is quite 
believable from the picture. 


Lemma 2 (Lunacy lemma). Given two regions C and C' that are neither equal nor 
opposite, we have 


A(C,C') =k 1, 


where k is the number of sides of C’ for which the sign vectors of C and C’ agree. 
Consequently, 


A(C,C') + A(-C,C’) =i-2 


if C’ is an i-gon. 


In Figure 15, for example, C’ has 3 sides, one of which separates C from C’; so 
k = 2 and A(C,C’) = 1, as we have already observed. 


Proof of the lunacy lemma. Assume that C’ has sign vector + + -:- + and apply the 
shelling lemma to the polygon C’, with x equal to any interior point of C. It 
follows that the sign pattern of C, restricted to the sides of C’, is a cyclic 
permutation of the string + + --- +——-:- —, where there are k +’s. Every pair 
of cyclically consecutive +’s corresponds to a lune of C’ that contains C. Since 
there are k — 1 such pairs, the lunacy A(C,C’) is indeed k — 1. To prove the 
second assertion of the lemma, note that for each of the i sides of C’ either C or 
—C has the same sign as C’. Thus the k’sfor C and —C sum to i. | 


Remark. If C=C’, then A(C,C’) =i, while if C = —C’ then A(C,C’) = 0. 
Hence A(C,C’) + A(—C, C’) =i rather than i — 2 if C= +C’. 


6.3. Convergence rate examples. Recall from Section 2.1 that the walk on regions 
is close to stationary after 2 steps for an arrangement of v planes in general 
position if 1 is large. Now that we understand the matrix K, we can give an 
example where the walk is not close to stationary after 1 step, as advertized in 
Section 2.1. In other words, we want || K;~ — 7|| to be bounded away from 0 for an 
infinite family of general position arrangements with larger and larger n. We use a 
family of arrangements .%, constructed by Fiiredi—Palasti [10], for which p,, the 
number of triangles, is asymptotic to 2n?/3 as n —> ©, (Recall from Section 2.1 
that f,, the total number of regions, is about n?. So there are about n’/3 
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non-triangular regions. Most of these are hexagons.) Fix a starting region C. We 
claim that for any triangular region C’, either A(C,C’) = 0 or A(C, —C’) = 0. To 
see this, we may assume C’ # +C. Then A(C,C’) + A(—C, C’) = 1 by the lunacy 
lemma, so one term is 0. Our claim now follows from the fact that A(—C,C’) = 
ACC, —C’). 

In view of the claim, there are p,/2 triangular regions C’ that cannot be 
reached from C in one step. Our main theorem, however, implies that this set 7 of 
triangles has stationary probability 


P3 1 


WO TURD 


The definition (2) of the distance between probability distributions now implies 
that ||K. — a||=> a7), which tends to 1/6 as n — ©, Thus the walk for .%, is not 
close to stationary after one step. 


Remark. On the other hand, there is a naturally occurring family of arrangements 
$, for which the walk is close to random after one step. Here &, is the cyclic 
arrangement, consisting of n great circles that form the sides of an n-gon in the 
Northern Hemisphere. There is, of course, also an n-gon antipodal to this one in 
the Southern Hemisphere. &, is a general position arrangement; for n > 5 there 
are 2n triangles and n(n — 3) quadrilaterals in addition to the two n-gons. The 
cases n = 3,4,5,6 are shown in Figures 1, 2, 5, and 6. A straightforward but 
tedious calculation shows that, for any starting region C, ||K- — a||< c/n forca 
universal constant; thus the walk for this family is close to random after one step 
when n is large. 


7. PROOF OF THEOREM 1. We are trying to calculate the stationary distribu- 
tion of the walk with transition matrix given by (6). The assertion is that 
b= 2 

(7) 
if C is an i-gon. There are two ways to proceed. 
Method I. As we remarked in the introduction, it suffices to show that the 
right-hand side of (7) is a probability distribution satisfying 

¥ a(C)K(C,C’) = 7(C) (8) 

C 


for all C’. Now we already know, by the Euler relation, that the right-hand side 
sums to 1 (Section 2.2). So it suffices to show, for each C’, that 


¥ (iC) - 2)K(E,C) =(C) - 2, 
C 
where i(C) is the number of sides of C. Equivalently, we must show 


L(i(C) — 2)A(C,C’) = fo(i(C') — 2). (9) 


Since i(—C) = i(C), we can replace A(C,C’) on the left by the average 
ACC. OC”) iC = C.C: 
mic,c) - MCI + Mee) 
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without changing the value of the sum. In view of the lunacy lemma, the left side of 
(9) becomes 


| | Oe an 
E(i(C) = 2)M(C.C') = ECC) = 2)-—— + AUC) ~ 2), 


where the second term takes account of the fact that M(+C’,C’) = i(C’)/2 
instead of (i(C’) — 2)/2. Pulling out the factor i(C’) — 2, and recalling that 
Yc(U(C) — 2) = 2(f, — 2), we obtain (9). 


Method 2. We know that 7 exists and is characterized by (8), which can be 
rewritten | 


L T(C)A(C,C') = fya(C’). (10) 
C. 


Since A(C,C') = A(—C, —C’), it follows that 7(—C) = a(C). We may therefore 
rewrite (10) as 


LU 7(C)M(C,C’) = fyr(C’), 
C. 


with M as in the first proof. Using the lunacy lemma (and taking account of the 
terms in the sum where C = +C’), the equation becomes 


1G 2 
E (CMP 4 an(c) =fya(C) 
C 


4 


or, since Lew(C) = 1, 


i(C’) —2 
= + 2n(C') = fy(C’). 


Solving for 7(C’), we obtain 


as required. = 


This second proof is somewhat more satisfying than the first, since equation (7) 
appears naturally, rather than being pulled out of a hat. Note also that the second 
proof did not use the Euler relation; it therefore has the Euler relation as a 
consequence. Note, finally, that Method 2 proves uniqueness of 7 (though not 
existence) without appeal to the theory of Markov chains. 


Remark. We are mystified by the factor i — 2 in Theorem 1. To help appreciate 
the mystery (and Theorem 1) let us consider a different random walk on the 
chambers of a hyperplane arrangement, called the local walk. Suppose the walk is 
currently in region C bounded by i hyperplanes. Pick one of the bounding planes 
uniformly at random. The walk moves to the chamber adjacent to C along the 
chosen hyperplane. Call the transition matrix of this walk H(C,C’). If i(C) 
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denotes the number of sides of C, then 
H(C,C’) = ie if C age toc 
0 otherwise. 


For this walk, an i-sided chamber C has stationary probability proportional to 
i =i(C); moreover, this walk is reversible: i(C)H(C, C’) = i(C’)H(C',C). The 
more vigorous walk K(C, C’) that we have been studying is not reversible except in 
special cases. It is one of only a few families of nonreversible walks where the 
stationary distribution is known explicitly. For the braid arrangement of Section 
5.1, the local walk corresponds to mixing a deck of cards by adjacent transposi- 
tions, whereas the walk of this paper corresponds to the (much faster) riffle 
shuffling. For tilings (Section 5.2), physicists currently use the local walk (cube 
flips) for simulating a random tiling; see [22]. The walk described here may offer 
vast speed-ups. 


8. FINAL REMARKS. The plane arrangements in R° that we have been 
considering in this paper occur in the literature under the name J/ine arrangements. 
In fact, a plane through the origin in R° is the same as a line in the projective 
plane P’, so we can think of .Y as an arrangement of lines in the plane. Recall that 
the points of P* are the lines through the origin in R°. Since a line through the 
origin in R? corresponds to a pair of antipodal points on the sphere S, we get a 
2-to-1 map S* — P*. The great circles in our spherical pictures are the inverse 
images of the lines in P* under this map. As a practical matter, if P* is viewed as 
the affine plane with an extra line at infinity, one goes from the projective picture 
to the spherical picture by drawing the projective picture on the Northern Hemi- 
sphere, the line at infinity becoming the equator. 

For a simple example, consider the arrangement of four lines in P? consisting 
of the three lines shown in Figure 17 together with the line at infinity. The 


Figure 17. Four lines in P*; the line at infinity is included. 


corresponding spherical picture is Figure 2. A more complicated example of a 
projective picture was given in Figure 7. The cell-decomposition of S* that we 
have been using in this paper is obtained by lifting an analogous cell-decomposi- 
tion of P*. Thus there is a pair of antipodal cells in S* for each cell in P?. 
There is an extensive literature on line arrangements (and “pseudoline”’ ar- 
rangements) going back to the 1820’s. Much is known, but there are still many 
open questions; see [14, 11] for surveys. Some of the more interesting results and 
Open questions concern the polygon counts p, (i = 3) where, following standard 
conventions, we now denote by p, the number of i-gons in the cell decomposition 
of P?; this is half of what was called p, in Section 2. A sample result is that for an 
arrangement in general position, one has 
pera 2 ep (11) 


i>5 
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(The reader can check that this equation holds for each column of Table 2, after 
the numbers in that table are cut in half.) Moreover, given any sequence 
P3> Ps» Pg» Pa» +++ Of natural numbers satisfying (11), one can always find a general 
position arrangement with those polygon counts. One has no control over p,, 
however, and, indeed, very little is known about p,. Equivalently, one has no 
control over the number of lines in the arrangement. Thus we are still very far 
from knowing, for a given n, what vectors (p3, p4,..., p,) cam occur. 
See [14, 11] for a wealth of further results, conjectures, and open questions. 
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A Counting Formula for Primitive 
Tetrahedra in Z° 


Mizan R. Khan 


1. INTRODUCTION. A_ primitive polytope is a polytope in n-dimensional Eu- 
clidean space R” whose vertices are (integer) lattice points (points of Z”) but it 
does not contain any other lattice points in its interior or on its boundary. Pick’s 
theorem coupled with the fact that any primitive (convex) polygon has at most 4 
vertices (by Theorem 2.1) allows us to infer that a convex polygon in Z* is primitive 
if and only if it is a triangle of area + or a parallelogram of area 1. In Z? matters 
are very different. Specifically, there is no bound on the volume of a primitive 
tetrahedron (see Section 3); but there is an elegant characterization of primitive 
tetrahedra that was discovered more than 30 years ago. This characterization gives 
us a simple formula that counts the number of equivalence classes of primitive 
tetrahedra of a given volume. The goal of this paper is to state and prove this 
formula. | 
We begin by defining the terms lattice polytope, vertex, and primitive. 


Definition 1.1. A lattice polytope is the closed convex hull in R” of a finite set 
{u,,...,U,,} of points of Z", i.e., it is the set 


{t,u, to +4,,u 


mo in? 


U,,...,U, €Z",0<t,,...,t, s1,t, + +t, = 1}. 
A vertex of a polytope is a point of the polytope that cannot be written as a convex 
combination of other points of the polytope. 


Remark. The lattice polytopes we consider always have non-zero volume. 


Definition 1.2. A lattice polytope in R” is primitive if the only lattice points it 
contains are its vertices. 


Reeve ((8] and [9]) and Reznick [10] use the term fundamental instead of 
primitive; Scarf [13] uses the term integral polyhedron. Our preference for using the 
adjective primitive instead of fundamental or integral stems from the usage in [14]. 
The interested reader should look at [14, pp. 98-99] for a discussion of primitive 
sets of vectors and the connection with constructing a reduced basis of a lattice. 

Section 2 is a brief discussion of primitivity in Z”. We explain why a primitive 
polytope in Z” has no more than 2” vertices and why a primitive parallelepiped in 
Z" has unit volume. Section 3 is a comparison of primitive triangles in Z* with 
primitive tetrahedra in Z°. The goals of this section are to explain why there is no 
bound on the volume of primitive tetrahedra and to present a geometric character- 
ization of primitive tetrahedra. In Section 4 we discuss unimodular maps and state 
an analytic version of the geometric characterization. Section 5 is where we discuss 
a formula that counts the number of equivalence classes of primitive tetrahedra for 
a fixed volume; the equivalence relation is defined via unimodular maps. The 
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formula is in Theorem 5.3 and is proved with the aid of Burnside’s lemma. The 
final section describes Howe’s generalization of Theorem 4.2, which describes all 
primitive polyhedra, not just primitive tetrahedra. Except for the results in Section 
2, we confine our remarks to dimension 3 because the characterizations of 
primitive tetrahedra do not extend to higher dimensions. 

To conserve space we typically write lattice points as row vectors, but in our 
matrix calculations we view lattice points as column vectors. 


2. PRIMITIVITY IN Z”. We begin by showing that the number of vertices of a 
primitive polytope is bounded, a result that appears as Theorem 1.2 in [13]. 


Theorem 2.1. A primitive polytope in R" has at most 2” vertices. 


Proof: If there are more than 2” vertices in a lattice polytope P, then there are 
two vertices v = (v,,05,...,U,), W = (W,,W,...,W,) Such that v, = w; (mod 2), 
i=1,...,n. The lattice point (v + w)/2 belongs to P, so P is not primitive. a 


We now discuss lattice parallelepipeds in R". We like to think of the lattice 
points in a lattice parallelepiped as the cosets of a quotient group. Suppose P is 
the parallelepiped P= {u + t,v, + tov. +++: +t,U,: U,U,,02,...,0, € Z", O< 
t1,to,...,t, < 1}. We can consider the sublattice (Zv, © Zv, © -:: © Zv,) and 
view the lattice points in P as the cosets of the quotient group Z”/(Zu, ® Zu, 
® --- ® Zu,). We have found the following result very useful. 


Theorem 2.2. Let v,,...,u, be a set of linearly independent points of Z". Then the 
quotient group Z" /(Zv, ® Zu, ® ::: ® Zu, ) has order |det A|, where A is then Xn 
matnx whose k-th column is v,, 1.e., the order equals the volume of the parallelepiped 
spanned by v,, Uz, ...5 Up. 

For a proof see [14, Theorem 20, p. 49]. Let us illustrate Theorem 2.2 and the 
remarks preceding it by examining the parallelogram P in Z* whose vertices are 
(0,0), (1, 3), (2, 2), and (3.5). 

The area of P equals 4. Therefore the quotient group Z*/(Z(, 3) ® Z(2, 2)) has 
order 4. 


Figure 1. The parallelogram with vertices (0, 0), (1, 3), (2, 2), and (3, 5). 
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The 8 lattice points in P represent the cosets of Z?/(Z(, 3) ® Z(2, 2)) in the 
following way: the zero coset can be represented by any one of the 4 vertices; the 
lattice points (1,1) and (2,4) represent the same coset (this coset is its own 
inverse); the lattice points (1,2) and (2,3) represent distinct cosets with one being 
the inverse of the other. Clearly, the quotient group is isomorphic to Z/4Z. 

From Theorem 2.2 and our observation about viewing lattice points as cosets, 
we can easily prove the following theorem about parallelepipeds in Z”. 


Theorem 2.3. The parallelepiped P = {u + t,v, + t,U. + +++ +t,V,! U, U1, U2, .++5 Uy 
€ Z",0 <t,,t,,...,t, < 1} is primitive if and only if volume(P) = 1 


3. PRIMITIVE TRIANGLES vs. PRIMITIVE TETRAHEDRA. We now compare 
primitive triangles in Z? with primitive tetrahedra in Z°. Any primitive triangle in 
Z* has area =. This is the simplest case of Pick’s theorem. 


Theorem 3.1 [Pick [7]]. Let P be a lattice polygon in Z*. If there are I lattice points 
in the interior of P and B lattice points on the boundary of P, then area(P) = 
1+3B-1. 


We refer the reader to [1] for two elementary and elegant proofs of Theorem 
3.1; [2] and [3] are two recent expositions on Pick’s theorem in the MONTHLY. 

Reeve [8] observed that the tetrahedron with vertices (0, 0,0), (1,0, 0), (0, 1, 0), 
and (1,1,c) is primitive for any non-zero integer c. Consequently, there is no 
bound on the volume of a primitive tetrahedron and a direct generalization of 
Pick’s theorem to R° is impossible. However, by replacing Z> with the fractional 
lattice =Z° for a fixed positive integer n, he was able to find a three-dimensional 
analogue of Pick’s theorem; we do not pursue this direction. In addition to [8] and 
[9], the interested reader should look at [6], where Reeve’s theorem is extended to 
higher dimensions. Other related references are listed on the first page of [2]. 

Another way to show that primitive triangles in Z* have area equal to ; is to use 
Theorem 2.3. It is instructive to see how the argument proceeds and why it cannot 
be extended to give an upper bound on the volume of primitive tetrahedra. 

Suppose T = {u + t,v, + tov): u,v,,0, € Z*, O< t,t, <1, t, +t, < 1} is a 
lattice point triangle containing the lattice point w. Let x be the coset of 
Z* /(Zv, ® Zv,) that w represents. There is at least one lattice point in the triangle 
{u +t, + t,05: u,v,,0, € Z?,0 < t,t, < 1,t, + t, = 1} that represents the inverse 
—x. It follows that 7 is primitive if and only if the associated parallelogram 
P={u t+ t,v, + tyd5: u,V,, 0, € Z*, 0 < t,, t, < 1} is primitive. Using Theorem 2.3 
we conclude that T is primitive if and only if area(T) = 3. 

Now let T, be the tetrahedron’ 


Te AG 2 1 Ua HD 0 St ie he ty ee), 


The preceding argument shows that 7, is primitive if and only if the tetrahedron 
| Ty = {u + ty, + tv, + ty03: U,V, 02,0, € Z?, 
VStar eS 3) 
is primitive. But we cannot conclude that the parallelepiped 
PSE; BEE Ug Mig Wye LU Sti t55 03s 1) 


is primitive since, in addition to 7, and T,, there are four other tetrahedra inside 
P that need not be primitive. Therefore, we cannot find a bound for the volume of 
P and use it to give a bound on the volume of T,,. 
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Recall that the volume of any lattice point tetrahedron is c/6, where c is a 
positive integer. Suppose 


1 =f ODP by PI i pie et Kt. b tis lier it t= 1) 
is a primitive tetrahedron and vol(T,) = c/6 with c > 2. The parallelepiped 
P= {u + 0, + t,0) + 1303: U,V,,UV,,03 € Z°,0 <t,,t,,t; < 1} 


has volume c. By Theorem 2.2 there are c — 1 non-zero cosets in the quotient 
group Z°/(Zv, ® Zv, ® Zv;). It now follows that there are c — 1 lattice points in the 
interior of P and each one represents a distinct coset of Z° /(Zv, ® Zu, ® Zv;). This 
assertion is based on the following two observations: 


1. A coset has a unique representative in P if and only if the representative 
lies in the interior of P. 

2. There are no lattice points on the boundary of P other than the vertices. 
Suppose w =u + t,v, + t,V, + tv, 1s a lattice point lying on the boundary 
of P that is not a vertex of P. There are two possible cases. Case 1: Exactly 
1 member of the set {t,,t,,¢,}, say ¢,, is an integer. In this case, either the 
lattice point u + t,v, + t,v,, or the lattice point u + (1 — ¢t,)v, + (1 — ¢t,)v, 
lies on one of the faces of 7;. Since ¢t,,t, € Z, neither lattice point is a 
vertex of P. Case 2: Exactly 2 members of the set {t,,¢,, t,}, say t, and fy, 
are integers. In this case, the lattice point u + t,v, lies on one of edges of 
T,. Since t, € Z, u + t,v, is not a vertex of 7,. Therefore, if P contains a 
lattice point on its boundary that is not a vertex, then T, is not primitive. 


Thus there are c — 1 lattice points in the interior of P. How are these c — 1 
lattice points arranged inside P? The answer to this question gives us a geometric 
characterization of primitive tetrahedra in Z°. 


Theorem 3.2. Consider the tetrahedron | 
T = {u + ty, + top, + tyd5: u,v, 0,,0,; € 27,0 <t,,t,t,<1,t,+1,+6, <1} 
and the parallelepiped 

P= {u + tv, + t,v, : {3031 U,V,,U7,03 € Z°,0 <t,,t,,t; < 1}. 


Then T is primitive if and only if all the non-vertex lattice points in P lie in the interior 
of one of the three diagonal parallelograms of P that do not intersect the interior of T, 
i.e., one of the parallelograms with vertices {u + V,,U + V,,U + UV, + U3,U + Vy + V3}, 
{u + V,,U+03,U+ 0, + U,,uU + 0; + U;}, Or {uU + 03,U + D1,U +0; +0,,uU + v, 
+ Us} 


The sufficiency of the stated criterion is trivial and all the work lies in proving 
its necessity. We have been unable to construct a direct geometric proof and we 
pose this as a problem to our readers. We prove Theorem 3.2 by proving an 
analytic version, Theorem 4.2. 


4. UNIMODULAR MAPS. Before we can describe an analytic version of Theorem 
3.2, we need to define the concept of a unimodular map. 


Definition 4.1. The map f:R"” — R” is a unimodular map if (i) f is affine, Gi) f 
preserves volume, and (iii) f maps points in Z” to points in Z”. 
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We leave it to the reader to check the following properties of unimodular maps. 


Theorem 4.1. Let f:R” — R” be a map and let P be a lattice polytope in Z". 


(i) f is unimodular if and only if f(u) = Mu + v, where M © GL,(Z), i.e., M is a 
n X n matrix with integer entries and det(M) = +1, andv € Z". 

(ii) If f is unimodular, then f is invertible and f~' is unimodular. 

(iii) Jf f is unimodular then f(P) and f~'(P) are lattice polytopes. Furthermore, f 
maps interior points of P to interior points of f(P), boundary points of P to 
boundary points of f(P), and vertices of P to vertices of f(P). 

(iv) If P is primitive, then both f(P) and f~'(P) are primitive. 


Unimodular maps define an equivalence relation on the set of lattice polytopes 
and so we can consider equivalence classes of lattice polytopes. 


Definition 4.2. Two lattice polytopes P,, P, in R” are unimodularly equivalent if 
there is a unimodular map f:R" — R"” such that f maps P, to P,. If P,, P, are 
unimodularly equivalent we write P, = P). 


Any two primitive triangles in Z* are unimodularly equivalent. One way to 
prove this is to apply Theorem 2.2. Suppose the triangles 


T, = {u, + tv, + tov,: Wy, 0,,0, € Z’,0 <t,,t, <1,t,+%, <1} 
and 
T, = {uy + tw, + two: Ug, W1,W2 © Z’,0 5 4,t, s1,t,+t%, <1} 
are primitive. Then the associated parallelograms 
P, = {u, + tv, + ty): U4,0,,0, € Z*,0 <t,,t, < 1} 
and 
P, = {u, + tw, + tW2: U,,W1,W, © Z’,0 < t,t, < 1} 


are primitive. Let A be the 2 X 2 matrix whose first column is v, and whose 
second column is v,; and let B be the 2 X 2 matrix whose first column is w, and 
whose second column is w,. Since P, and P, are primitive parallelograms, 
A, B € GL,(Z). The unimodular map f(x) = BA~'x + (u, — BA~‘u,) maps T, to 
T, (and P, to P,). This argument can be used to show that any two primitive 
parallelepipeds in Z” are unimodularly equivalent. 

We now describe an analytic version of Theorem 3.2, which appears as Corol- 
lary 5.7 in [10]. From here on, 7, , , denotes the tetrahedron with vertices (0, 0, 0), 
(1, 0,0), (0, 1,0), and (a, b, c), with (a, b,c) € Z? and c # 0. 


Theorem 4.2 [Reeve-White-Howe-Scarf-Reznick]. A tetrahedron T is primitive if and 
only if T= Ty) 9, or T=T,,, with1 <b <c and gced(b,c) = 1. 


We refer the reader to [10] for a proof of Theorem 4.2, and to [11] and [13] for 
proofs of variants of it. It is surprising that such an elegant and simple theorem is 
not better known. 


5. THE COUNTING FORMULA. Unimodular maps preserve volume; conse- 


quently, the number of distinct (unimodular) equivalence classes of primitive 
tetrahedra is infinite. We are led to consider the following question. 


1999] A COUNTING FORMULA FOR PRIMITIVE TETRAHEDRA IN Z° 529 


For a given positive integer c, is there a counting formula for the number of 
distinct (unimodular) equivalence classes of primitive tetrahedra of volume c/6? 
The answer lies in determining the relationship between the integers x and b 
when 7, ,.=T7;,,,. The relationship is described in the following variant of 
Theorem 5.6 in [10]. 


Theorem 5.1. Letb,b~',c,x € Z with1 <x,b,b™! <c, gcd(b, c) = gced(x, c) = 1, 
and bb~' = 1(mod c). The primitive tetrahedra T, , , and T, , , are unimodularly 
equivalent if and only if x € {b,c — b,b~!,c — b“‘}. 
Proof: (=) We can view a unimodular map from T, , , to T, , , as a map between 
the vertices of T, , , and T, , ,. There are 24 possible maps. An examination of 
each of these shows that x € {b,c — b,b7',c — b+}. 

For example, suppose we look at the unimodular map 


(0,0,0) +> (0,1, 0), (1,0,0) +> (1, x, c), (0,1, 0) +> (0,0, 0), (1, b,c) +> (1,0, 0). 
This map has the form 


1 0 uy 0 
Ah, =]. 34 u,|} + ]1 
C 0 —1]\u, 0 


with the conditions that a € Z and x —b+ ac = 0. Since |x — b| <c, it follows 
that a = 0 and therefore x = b. 
(=) T,,. = 7, ._5,- Via the unimodular map 


1 0 O\ [u, 0 
=|. a L}Jju,|/ +14, 
0 0 1} \u, 0 
T5,¢ = 1), 5-1, via the unimodular map 
1 — bb! 
0 bt ——]|]{u,| +0}, 
C 
0 c¢ —b U 0 
and 7, ,. = 7;,._»-1,- via the unimodular map 
=1 0 v uy 1 
b(c-—b™') +1 
0 c-b' —-———||u,| 1+] 0]. al 
C 
QO Cc —b U v 


Before proceeding to the statement of the counting formula, it is instructive to 
_ determine the number of equivalence classes of primitive tetrahedra of volume 
c/6 for some specific values of c. We do so for c = 13 and c = 14. We start with 
the case c = 13. Because of Theorem 4.2 we need consider only the 12 primitive 
tetrahedra 7) 1 43,7),.2 13574,3 135--->14,12,13. and determine which are unimodu- 
larly equivalent. Theorem 5.1 ensures that 7), 3,= Ty 1243, T1223 = tio. = 
T7413 = Ti £1,313 = T1413 = L1,9,13 = 41,10, 135 and Ty, 5,13 = 11,8, 13, 80 there 
are 4 equivalence classes of primitive tetrahedra of volume 13/6. For c = 14, we 
need to consider only the 6 primitive tetrahedra Ty 114, T,,3 14, T1514) T1014 
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Tiiyiae and Tyizyg Since Ty iy = Ty i314 and T1344 = T1514 = Tou = 
T\, 11,14, there are 2 equivalent classes of primitive tetrahedra of volume 14/6. 

We now prove the counting formula by combining Theorem 5.1 with Burnside’s 
lemma. 


Theorem 5.2 (Burnside’s lemma). Let a finite group G act on a finite set X and have 
N orbits. Then 
1 


N= 
|G| 


¥ Ix,I, 


gEG 
where X, = {x: g(x) =x, x © X}, i.e., X, is the set of fixed points of g. 


See [12, Theorem 2.71, p. 125] for a proof, as well as an explanation of why it is 
referred to by some as not-Burnside’s lemma. 
We are now ready to state and prove the counting formula. 


Theorem 5.3. Suppose n is an integer greater than 2, T(n) is the number of distinct 
equivalence classes of primitive tetrahedra of volume n/6, f(x) € (Z/nZ)Ex], and 
N(f(x) = 0 (mod n)) denotes the number of solutions of the congruence f(x) = 0 
(mod n) in Z/nZ. Then 


T(n) = o(n) + N(x* — 1 = 0 (mod _ + N(x* + 1 = 0 (mod n)) | 


In particular, if n is a prime p, then 


+ 
- if p = 1 (mod 4) 
4 4 
T(p) =|F|+1- pat 
7 if p = 3 (mod 4). 


Proof: Let U = {x: 1 <x <n, gcd(x, n) = 1} be the set of units of Z/nZ and let 
G = {8,, 85, 83, 84} be the group of bijections of U, where g,(x) =x, g,(x) = 
n —x, g(x) =x', and g,(x) =n —x~' for any element x € U. The cardinality 
of U is $(n), the Euler phi function evaluated at n, and G = Z/2Z ® Z/2Z. 


Theorem 5.2 says that T(m) equals the number of orbits of U under the action 
of G. Let U, = {x: g(x) =x, x € U}, i = 1,2,3,4. Since g, is the identity it fixes 
all of U. When n > 2, g, has no fixed points. The fixed points of g, are the 
solutions of the congruence x* ='1 (mod n) and the fixed points of g, are the 
solutions of the congruence x” = —1 (mod n). Thus, 


|U,| = o(n), |U,| = 0, |U;| = N(x* — 1 = 0 (mod n)), 
|U,| = N(x? + 1=0(modn)). 
We now apply Burnside’s lemma to obtain the formula 


T(n) = o(n) + N(x* — 1 =0 (mod = + N(x* + 1=0(modn)) | 


Theorem 4.2 shows that if T is a primitive tetrahedron of volume 1/6 then 


T = To ,, and if T is a primitive tetrahedron of volume 1/3 then T = T, , ,; so in 
the exceptional cases when n = 1,2 we have T(n) = 1. The reader can observe 
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that b(n), N(x? — 1 = 0 (mod n)), and N(x? + 1 = 0 (mod n)) are multiplicative 
functions of n. We now cite a result from the theory of quadratic residues that 
gives a formula for N(x* — 1 = 0 (modn)) and N(x? + 1 = 0 (mod n)). 


Theorem 5.4. Let n = 2‘m, with n > 2 and gcd(m, 2) = 1. If m > 1, let the prime 


divisors of m be p,,..., P,; otherwise set t = 0. Then 
oh ifk = 0,1 
N(x* —1=0(modn)) = (2'*! ifk =2 
Qe Giles 3: 
and 


N(x? + 1 =0 (mod n)) 


_ | 2° ifk <1 andp; = 1 (mod4) foranyi &€ {1,...,¢} 
— \0ifk = 2 or there exists ani € {1,...,t} such that p, = 3 (mod 4). 


See [5, Chapter 2, Section 8; Chapter 3, Section 5] for a proof. 


6. HOWE’S THEOREM. We now describe Howe’s theorem on primitive polyhe- 
dra. Theorem 2.1 shows that any primitive polyhedron in R° has at most 8 vertices. 
Howe proved that any primitive polyhedron with eight vertices is, up to a 
unimodular map, the convex hull of a square and a parallelogram. 


Theorem 6.1 [Howe]. Let P be a primitive polyhedron with eight vertices. Then there is 
a unimodular map that maps P to the polyhedron whose vertices are (0, 0,0), (1, 0, 0), 
(0, 1, 0), (0, 0, 1), (0, 1, 1), G, a, b), 1, c, d), and (1, a + c,b + d) witha, b,c, d, € Z, 
a,b,c,d =0 and ad — bc = 1. Furthermore, any primitive polyhedron with fewer 
than eight vertices can be embedded in one with eight vertices. 


We refer the reader to the article by Scarf [13] for a proof of this theorem. It 
should be noted that the four vertices (0, 0,0), (0, 1, 0), (0, 0, 1), and (0,1, 1) lie on 
the plane x = 0 and form a square of area 1. The other four vertices (1, 0, 0), 
(1, a,b), (,c, d), and (1,a + c,b + d) lie on the parallel plane x = 1 and form a 
parallelogram of area 1. 

The problem of characterizing primitive tetrahedra was independently studied 
in 1957 by Reeve [8] and in 1964 by White [15]; Theorem 4.2 arises from combining 
their results. In 1977, Howe independently discovered Theorem 4.2 and its 
generalization, Theorem 6.1. He did not publish his work and it was Scarf [13] who 
first publicized Howe’s theorem. Over the years, other mathematicians have 
rediscovered Theorem 4.2. For example, Therese Hart, Karen Rogers; and I 
discovered it in 1991 and wrote up our results in the unpublished manuscript [4]. 
We then came across Reznick’s [10] article where we learnt of the work of Reeve, 
White, and others. The contents of [4] are included in Chapter 2 of K. Rogers’ 
doctoral dissertation [11]. The last chapter of this dissertation contains some 
partial results on primitive simplices in Z’*. 

We leave the reader with the following question: Are there analogues of the 
counting formula for primitive polyhedra with 5, 6, 7, and 8 vertices? 
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What Makes a Great Mathematics Teacher’? 
The Case of Augustus De Morgan 


Adrian Rice 


1. INTRODUCTION. It is often said that no one forgets a good teacher. Whether 
this statement is true or not, almost everyone can recall at least one teacher who 
influenced some aspect of his or her future study or career. But there are 
occasional examples of great mathematics teachers who instill a remarkable 
number of their students with a love and enthusiasm for the subject, which has a 
lasting and profound effect on them, even if they never become practising mathe- 
maticians. 

The nineteenth century British mathematician Augustus De Morgan (1806-1871) 
was one such teacher. Although his name is well known to any student of set 
theory, his chief mathematical legacy arose from his novel research in logic. This 
research created the first logic of relations, and promoted a symbolic approach to 
the subject, in which regard he greatly encouraged the work of his friend and 
contemporary George Boole. [34] De Morgan was also interested in algebra [26], 
and his attempts to extend the geometrical representation of complex numbers 
influenced the discovery of quaternions by his friend William Rowan Hamilton. 
Mathematical analysis, a subject very much neglected in early nineteenth-century 
Britain, also occupied much of De Morgan’s attention, and he produced notable 
work on convergence of series. [20, pp. 148-9] He also published many research 
papers on various aspects of the history of mathematics, about which he was a rare 
authority at the time. [29], [31] 

By the end of his career in the mid-1860s, De Morgan was one of the most 
influential and highly-regarded mathematicians in Britain, outliving Hamilton and 
Boole by several years and out-ranking the likes of Cayley and Sylvester (them- 
selves far more original mathematicians) by virtue of his age. But how was this 
reputation achieved? Why was he so highly regarded? 

One obvious reason is curiously often the most overlooked. For virtually his 
entire career, De Morgan was professor of mathematics at University College 
London, a radically innovative establishment, which, at its foundation in 1826, was 
the first university-level institution to be established in England since Oxford and 
Cambridge in the Middle Ages. There, he single-handedly delivered courses on 
mathematics to a generation of undergraduates for a third of a century. Because 
he was in charge of mathematical tuition at the leading higher educational 
institutional in his nation’s capital, he was a formative influence on numerous 
mathematicians, scientists, and other prominent intellectual figures of the Victo- 
rian period. 

Unfortunately, there is a pronounced absence of published material pertaining 
to De Morgan’s work as a teacher. We know that he wrote a series of very 
successful textbooks, and these are useful to some extent; but they shed little light 
on what he actually taught in his lecture room. 

However, there is a major unpublished source of information about De Morgan’s 
teaching. This source was described in the Encylopedia Britannica as “a large mass 
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of mathematical tracts which he prepared for the use of his students, treating all 
parts of mathematical science, and embodying some of the matter of his lectures”’. 
[24, p. 9] They are preserved in the University of London Library in the form of 
more than 320 notebooks containing the majority of De Morgan’s course material 
in his own handwriting. The contents of these manuscripts give us a considerable 
insight into the material covered by mathematics students at the most progressive 
educational establishment in nineteenth-century England. 

In this article, we first investigate the content of De Morgan’s mathematical 
course and examine his teaching methodology; we then consider evidence from his 
students about how this material came across. Finally, by comparing his teaching 
style with that of other well known nineteenth-century mathematicians, we high- 
light some of the ingredients that go into making a great mathematics teacher, and 
show that these criteria are satisfied in the case of Augustus De Morgan. 


2. DE MORGAN’S PROFESSORIAL CAREER. Following his birth in India in 
June 1806, De Morgan spent the majority of his formative years in southwest 
England, where he received an adequate classical education. In 1823, aged only 
sixteen, he entered Trinity College, Cambridge, where his mathematical talents 
were nurtured by his tutors, who included the prominent English mathematician 
George Peacock. Peacock, Charles Babbage, and John Herschel had founded the 
short-lived but influential Analytical Society in 1812, [17] which helped secure the 
adoption of Lagrange’s algebraic methods of calculus in the Cambridge syllabus, 
replacing the Newtonian fluxional system, which had been entrenched in Britain 
for well over a century. [21] 

De Morgan’s Cambridge years coincided with the foundation of a university in 
London, the only capital city in Europe without such an institution at this time. 
Indeed, up to this period, Oxford and Cambridge were the only places in England 
to offer university qualifications, and since they were fully open only to members 
of the Church of England, denominations such as Catholics or Jews were effec- 
tively barred from university degrees. So too were the urban middle classes who, 
while not poor, were nevertheless financially incapable of supporting their off- 
spring through courses of study away from home. The establishment of University 
College London (originally titled “London University”) in 1826 was a radical 
solution to this problem, made all the more so by its explicit secular character and 
progressive programme of studies. 

Equally daring was its choice of founding professor of mathematics. De Morgan 
was appointed to the position in February 1828, scarcely a year after his graduation 
from Cambridge, aged only twenty-one. [30] But not all went smoothly. After 
opening for lectures the following October, the new university was plagued by 
financial troubles and petty personal disputes. After a professorial colleague was 
dismissed in 1831, De Morgan immediately resigned on principle. Five years later, 
however, he was invited to return after the premature death of his replacement. 
He was to remain for a further thirty years. 

His final departure was occasioned by the college’s (non-)adherence to its policy 
of religious equality; indeed, he remains the only professor in the history of 
University College to have resigned twice on matters of principle. For De Morgan, 
the college’s refusal to appoint a candidate to the vacant chair of philosophy on the 
grounds of his being a controversial Unitarian minister was a betrayal of its 
founding principles. He resigned his professorship on 10 November 1866, giving his 
last lecture in the summer of 1867. He never returned, refusing even a request 
from his former students to sit for a bust to be placed in the college library, 
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Figure 1. Augustus De Morgan pictured in 1866. 


explaining that, as far as he was concerned, “our old college no longer exists”. 
[15, p. 360] 


3. THE COURSE. During the period of De Morgan’s professorship, his mathe- 
matics course formed a central component of the college’s curriculum. It was 
intended to constitute part of the students’ first two years, during which time they 
would also study such subjects as Latin, Greek, and natural philosophy (..e., 
physics). Since school education was not yet compulsory and the school leaving age 
was, On average, around fourteen, the students at University College in De 
Morgan’s day were substantially younger than they are now. In general, they 
ranged from 15 to 18, usually leaving the college to begin vocational training, 
employment or, in the case of the exceptional students, more advanced study at 
Oxford (if they were classically inclined) or Cambridge (if mathematically). 
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In mathematics the students were divided into classes corresponding to the first 
and second years of undergraduate study, with each class being further divided 
into lower and higher divisions. The first year (or junior) course was designed to 
contain “what is most essential for those who are intended for practical profes- 
sions, such as Civil Engineers, &c.’’, [33, p. 42] while the senior class was intended 
for those capable of tackling more advanced topics. However, due to the range of 
materials available, the course was “confined principally to those parts of the 
subject which are necessary for the study of Natural Philosophy’’. [33, p. 42] The 
following outline indicates that this definition was a very broad one: 


JUNIOR CLASS, LOWER DIVISION: 


i) Arithmetic and the arithmetical theory of proportion 
ii) Euclid, Books 1-4 
iii) 6th Book of Euclid 
iv) First book of Solid Geometry in Euclid 
v) Algebra, arithmetically considered, up to equations of the first degree. 


JUNIOR CLASS, HIGHER DIVISION: 


i) Euclid, Books 5 and 6 

ii) First book of Solid Geometry in Euclid 
iii) A review of the principles and operations of arithmetic 
iv) Algebra (including the nature and use of logarithms) 
v) Plane trigonometry (including mensuration). 


SENIOR CLASS, LOWER DIVISION: 


i) Spherical trigonometry 
ii) Conic sections 
iii) Applications of algebra to geometry 
iv) Higher parts of algebra 
v) Differential and integral calculus. 


SENIOR CLASS, HIGHER DIVISION: 


Extension of subjects in the Lower Senior Class. “Subjects which all must 
learn who wish to become analysts, whether for Engineering or any other 
pursuit.” [39, pp. 6-7], [40, pp. 7-8], [41, pp. 19, 35] 


However, De Morgan was at pains to point out that this plan should not be 
regarded as a definitive declaration of intentions. As he said in his inaugural 
lecture of 1828, “I shall not consider myself bound to carry the class through the 
whole of what is contained in it if it shall appear that their interest will be more 
effectually consulted by my confining myself to the more prominent parts of it.” 
[6, f. 45] As far as he was concerned, it was quality of knowledge that mattered 
more than quantity. 

In order to achieve this, De Morgan highlighted the two principal methods 
whereby his students could acquire mathematical knowledge: “The first is by 
diligent study in the retirement of the closet; the second, by haunting the benches 
of the lecture-room, and picking up what may chance to fall.” [13, p. 14] Lectures 
alone, he maintained, were insufficient to bring the student to the appropriate 
level of understanding. Moreover, student lecture notes, while important, were no 
substitute for a full treatise; indeed, he compared the information obtained from 
listening to a lecture to the comprehension achieved from reading a book at speed. 
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De Morgan thus regarded the role of lectures as merely providing students with 
assistance in difficulty and guidance on relevant reading. 

In order to enlarge this oral instruction, he prepared a vast quantity of 
handwritten tracts on all aspects of his course, which were then placed in the 
University College library for his students to refer to. They were designed to 
supplement not only the lecture material, but also the wide reading that De 
Morgan expected his students to undertake.! The surviving notebooks (written 
between 1843 and 1866) are well over three hundred in number, each featuring De 
Morgan’s legible handwriting. De Morgan also had an idiosyncratic habit of pasting 
in printed material that he considered particularly relevant; these insertions often 
consisted of an appropriate paper, usually by himself. These tracts reveal that he 
taught much more than was indicated in the published syllabi and exam papers. 
What now follows is a survey of the material contained in the existing tracts to give 
some idea of what a student of mathematics at University College could have 
expected to study under De Morgan 150 years ago. 


TABLE 1. De Morgan’s surviving tracts 


CLASS NUMBER OF NOTEBOOKS 


Lower Junior 


Higher Junior 
Lower Senior 
Higher Senior 


3.1 The Lower Junior Class. Of the 327 surviving notebooks, only ten contain 
material designed for the use of students in De Morgan’s lower junior class. Not 
only was the subject matter far less extensive than in succeeding classes, but also, 
in the mode of tuition adopted by the Professor for this class, oral lectures 
occupied a very small place, the majority of the time being devoted to giving 
written exercises and answering students’ questions. Furthermore, for much of the 
relevant material at this introductory level, existing textbooks were perfectly 
adequate, such as his own Elements of Arithmetic (1830) and Elements of Algebra 
(1835), as well as numerous editions of Euclid’s Elements. 

As well as giving alternative presentations of material that could be found in the 
students’ books, the tracts dealt in some considerable depth with matters with 
which most textbooks (even De Morgan’s) did not concern themselves. One of the 
most fascinating tracts (#110-comprising three notebooks) was designed to be 
read before the student opened the first page of Euclid. Entitled “Notions 
preliminary to Geometry’, it illustrates De Morgan’s desire for his students to be 
acquainted from the very start with the philosophical and epistemological issues 
relating to the subject. More significantly, it demonstrates his belief that a 
thorough grounding in logical notions and processes was essential for the students’ 


‘In the absence of photocopiers, many would copy the contents of these tracts wholesale, as evinced 
by a large volume in University College Library containing a student’s transcription of 33 of them: 
University College London Archives, MS.ADD.6, “Mathematical Tracts by Professor De Morgan, 
copied from the original Manuscripts in the Library of University College London by John Power Hicks 
1849-1851”. 
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understanding of geometrical arguments.” As he said: 


The principles on which geometrical propositions are established belong to 
the totally distinct and equally simple science of logic; and since geometry 
without logic would be absurd, it is desirable that the principles of the latter 
science should be studied with precision previously to employing them upon 
the former. [9, pp. 238-9] 


De Morgan was wont to complain about the lack of contact between the disciplines 
of mathematics and logic: “Geometers have seldom been very formal logicians; 
and their patent of exemption was signed by Euclid.” [7, p. 435] One of the 
principal sources of confusion when initiating students into the study of geometri- 
cal demonstrations was the distinction between a proposition and its converse. So, 
for example, the statement that ‘all equilateral triangles are equiangular’ was often 
taken to imply that ‘all equiangular triangles are equilateral’. “These errors,” said 
De Morgan, “should be guarded against beforehand, by exercising the pupil in 
simple deductions, such as are to be found in every syllogism, taking care that all 
terms used have reference to objects with which they are familiar. It should be 
illustrated to them that the truth of an argument depends on two distinct 
considerations, the truth of the premises, and the manner in which the conclusion 
is deduced from them.” [8, pp. 272-3] 

The next step before coming to actual geometrical demonstrations was to 
introduce his students to the concept of a proof. “A proposition”, he wrote, “may 
be proved in two ways: Directly, by showing that it is true. Indirectly, by showing 
that the contradiction is false.” [46, f. 8] Since the latter was conceptually the most 
difficult for the beginner, this tract was principally concerned with this mode of 
procedure, which “forces an absurd result out of the contradiction, and therefore 
forces the denial of the contradiction, or the affirmation of the proposition”. 
[46, f. 9] Proof by contradiction, was, in De Morgan’s opinion, rendered far more 
intelligible by the early study of logic, and mastery of the technique was a vital skill 
to acquire before tackling Euclid. 


3.2 The Higher Junior Class. On entering De Morgan’s higher junior class, his 
students were expected to be fully familiar with Euclidean deductive reasoning up 
to the fourth book of the Elements. But the fifth book, introducing the complex 
ideas of ratios and proportion, often caused the most problems. De Morgan noted 
that, owing to its highly convoluted presentation, “it has been customary for 
mathematical students among us to read the Fifth Book of Euclid; frequently 
without understanding it”. [10, p. iii] For this reason, he substituted arithmetical 
notions of proportion instead of the traditional geometrical ones, a simplification 
that helped to make the subject far more intelligible to his junior students than if 
he had left them to study it unaided. 

Despite his almost instinctive mathematical abilities, De Morgan was fully 
aware of the need to eliminate as many barriers as possible to the beginner’s 


* His writings on geometrical education provide the first published evidence of De Morgan’s interest 
in logic, although at this point it was utilised purely as a pedagogic tool. He later elaborated his ideas in 
a short book for his students entitled First Notions of Logic( preparatory to the study of geometry), 
published in 1839. This was later incorporated as the first chapter of his Formal Logic in 1847, by which 
time his interest in logic had transcended its utility merely as an aid to geometry, and was manifesting 
itself in the publication of research papers concentrating more on the intrinsic nature of the subject 
itself. 
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understanding of unfamiliar mathematical topics. A further aid to the students’ 
geometric cognition was his rejection of perspective drawings in favour of three- 
dimensional models. As he explained to the audience of his introductory lecture: 


Considerable obstacles generally present themselves to the beginner, in 
studying the elements of Solid Geometry, from the practice which has 
hitherto uniformly prevailed in this country, of never submitting to the eye of 
the student, the figures on whose properties he is reasoning, but of drawing 
perspective representations of them upon a plane. ...I hope that I shall 
never be obliged to have recourse to a perspective drawing of any figure 
whose parts are not in the same plane. [6. ff. 50-1] 


Having dwelt extensively on Euclidean-related matters, the higher junior class 
would have then turned their attention to a recapitulation of the rules and 
procedures of arithmetic before being initiated into abstract algebra. The lower 
junior class would have already practised linear equations, but these were treated 
more as features of a universal arithmetic than a general algebra. Now the class 
was ready to learn the distinction between the two: “In Anthmetic every symbol of 
magnitude... represents a number, and nothing but a number... Algebra employs 
the symbols, the language and the rules of arithmetic. But...no letter a has its full 
meaning described until we are told both its value and its sign.” [50, ff. 1, 4] 

De Morgan’s algebra was a long way from the highly-evolved structural algebra 
of today. Yet it was to play a significant role in the development of abstract 
algebra, building on earlier work by his friend and former tutor, George Peacock. 
[27] However, whereas in Peacock’s algebra the symbols were generally understood 
to represent numbers or operations, De Morgan would deliberately keep them 
abstract. “Thus,” he wrote “addition is to be, for the present, a sound void of 
sense. It is a mode of combination represented by +; when + receives its 
meaning, so also will the word addition.” [14, p. 101] It was an area to which he 
was to devote much research, although, since it was too advanced for his junior 
classes, he deferred its discussion until his pupils had reached the senior level. 

Now that they were familiar with algebraic terminology and ideas, the students 
were ready to progress to the solution of quadratic, cubic, and higher order 
equations. It was at this stage that students would have first come across the 
binomial theorem, leading immediately to the study of series, both finite and 
infinite. This turn led them to convergent and divergent series, resulting in their 
introduction to one of the most crucial mathematical concepts, recently reinstated 
in analysis: the limit. 

In Cambridge during the second decade of the nineteenth century, the Analyti- 
cal Society had been instrumental in replacing Newton’s fluxional calculus with the 
algebraic method of Lagrange, [16] thus rejecting a system based (albeit very 
dubiously) on the notion of a limit. This concept had been reformulated by Cauchy 
in the early 1820s, but was not immediately accepted in France or elsewhere. 
De Morgan’s Elements of Algebra was the first English work to contain a definition 
of the continuity of a mathematical function using limits. His subsequent treatise 
on The Differential and Integral Calculus (1842), the most comprehensive English 
work on the subject for over a generation, was entirely grounded on the concept 
of limits. 

But, he said, it was meaningless to ask what a limit was since “‘What is the 
limit’ is the same question as ‘What is the exact expression for that which does not 
admit of exact expression’.” [51, f. 2] What then was the point of introducing limits 
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in the first place? “We introduce them because we cannot do without them, being 
all the time perfectly willing to do without them if any one will show us how.” 
[51, f. 3] Yet despite these reservations, De Morgan was nonetheless firmly 
convinced of their epistemological soundness and, as one of the first in Britain to 
publish and teach mathematics using limits, his work helped establish this concept 
as the basis of modern mathematical analysis. 

The use of limits also formed the basis of his introduction to logarithms as well 
as his teaching of trigonometrical analysis, where he employed limits to analyse the 
various properties of expressions such as sin x /x and (1 — cos x)/x*. His tracts on 
trigonometry for this class began with the usual problems of plane trigonometry 
such as finding values of angles and sides given certain information, progressing to 
questions involving multiple angles and inverse functions. 

These were the topics studied by De Morgan’s higher junior class, as specified 
by the published syllabus, However, the existing tracts reveal that students were 
also given instruction in other areas, the first being interest and annuities. Interest, 
both simple and compound, was covered by De Morgan in his Elements of 
Arithmetic [12, pp. 150-60], but his tracts extended this treatment to include the 
rudiments of acturial mathematics, introducing the students to the complex calcu- 
lation of annuities based on mortality tables. Permutations and combinations were 
also covered, leading directly to elementary problems in probability theory. Inter- 
estingly, for this subject, De Morgan relied on a popular algebra primer written by 
a former student, Isaac Todhunter, [35] who by the 1850s had become a successful 
textbook author. 

Additional geometrical topics were also begun in this class, albeit at a fairly 
introductory level, the first being algebraic geometry. At this stage, problems set 
primarily involved either tracing curves or finding the intersection of two lines by 
solving simultaneous equations. Their initiation into projective geometry reached a 
slightly higher stage, proceeding as far as Pascal’s and Brianchon’s theorems. They 
and much more besides, would be repeated in full in his lower senior class. 


3.3 The Lower Senior Class. By the time they entered the lower division of De 
Morgan’s senior class, the majority of his students would have completed at least 
one year of mathematical study. That year would have provided the students with a 
study programme of considerable intensity. However, this pales in comparison with 
the level of material covered during the following year, as illustrated by the 
number of relevant notebooks still in existence: in comparison to the 80 such 
documents relating to the junior classes, there are no fewer than 247 notebooks 
concerning material covered by the two divisions of the senior class. 

According to published sources at least, the lower senior course began with an 
introduction to spherical trigonometry. Again, De Morgan’s tracts on this topic 
supplemented both his lectures and a book on the subject—in this case, a small 
textbook he had written in 1834. His tracts included further explanation and 
examples of various points, including statements and proofs of the standard 
formulae for spherical triangles, and problems such as finding areas, inscribing and 
circumscribing circles, and supplemental triangles. 

Compared to just five items on spherical trigonometry, the number of individual 
notebooks containing material relating to conic sections is well over twenty; 
moreover, De Morgan’s treatment often varies from tract to tract. To begin with, 
the conics would have been defined purely geometrically. De Morgan would then 
introduce the closely-related topic of projective geometry, although, judging from 
the higher junior tracts, his students would already have received some introduc- 
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tion to the subject by this time. De Morgan’s justification was that “the method of 
projections establishes the more general and more difficult properties of the conic 
sections with greater ease than the ordinary methods”. [49, f. 1] His projective 
geometry largely consisted of an analysis of various properties and peculiarities of 
projective figures, such as colinearity and involution, with all demonstrations 
relying on neatly drawn diagrams and Euclidean-style proofs. 

Once the class had reached a certain level of proficiency in projective geometry, 
De Morgan would employ algebraic geometry to give alternative demonstrations of 
similar—and, in some cases, the same—results. Having already defined straight 
lines and circles algebraically in the higher junior class, he began this level with a 
discussion of the general second degree equation ay* + bxyy + cx* +dytext+f 
= (), and considered the curves generated by its different variations. In such a way, 
he was able to give yet another introduction to the conic sections, extending the 
treatment to include algebraic treatments of results originally proved using projec- 
tive geometry. 

At this stage the class would have reached a fairly advanced level of algebra; 
indeed, by this time, their algebraic exercises included multiplying and dividing 
polynomials, and solving cubics using Cardano’s and Ferrari’s methods. Among 
other algorithms taught by De Morgan in the theory of equations was Horner’s 
method,* a procedure for approximating roots of equations with no exact solution. 
He later described his motivation for introducing this method, and the results his 
students obtained after applying it to the equation x° — 2x = 5: 


In 1831, Fourier’s posthumous work on equations [18, pp. 209-17] showed 33 
figures of solution, got with enormous labour. Thinking this is a good 
opportunity to illustrate the superiority of the method of W. G. Horner, not 
yet known in France, and not much known in England, I proposed to one of 
my classes, in 1841, to beat Fourier on this point, as a Christmas exercise. I 
received several answers, agreeing with each other, to 50 places of decimals. 
In 1848, I repeated the proposal, requesting that 50 places might be ex- 
ceeded: I obtained answers of 75, 65, 63, 58, 57, and 52 places. [5, p. 292] 


It is here that we begin to detect a new feature in De Morgan’s teaching: a 
desire to acquaint the more advanced pupils with recent mathematical develop- 
ments. Tract #25, for example, contains material that, while ostensibly concerned 
with the theory of equations, would nowadays be considered as part of complex 
analysis, being straight from the pages of recent works by Cauchy and Argand. It 
also includes several new proofs of the existence of a root of every equation, 
including a paper by De Morgan on the subject, pasted in the back as usual, 
although his advice to students was: “Read Argand first, and then examine 
Cauchy’s”’. [43, f. 17] 

The lower seniors would also have been presented with many of the latest 
results in modern analysis, especially in their study of infinite series. But it was 
only after they had been given a thorough grounding in algebraic and analytic 
Operations, especially regarding the meaning and significance of limits, that they 
were initiated into the subject of the differential calculus. From the tracts, we can 


*Much of De Morgan’s treatment of conic sections in his tracts on algebraic geometry was taken 
directly from George Salmon’s Treatise on Conic Sections, Hodges and Smith, Dublin, 1847. 

“Named after William George Horner (1786-1837), a school-teacher from southwest England, due 
to a paper he published in 1819. 
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be sure that a fair number of students encountered problems; a particular 
difficulty concerned the differential coefficient of a function—a problem still 
encountered by students today. 


A beginner sees (1 + x”)°, and remembering that x’? gave 3x’, he writes 
down 3(1 + x*)*. He ought to have written 3(1 + x7)? X 2x. The truth is that 
he has correctly answered a question,—but not the question which was asked. 
[45, f. 17] 


De Morgan’s initial teaching of integration proceeded no further than finding 
areas under curves. However, there is evidence that he began elementary instruc- 
tion on differential equations in this class, although this involved little more than 
defining basic notions such as the order of an equation, the integrating factor, and 
how to find general and singular solutions. Such an introduction would have been 
of little use to those who chose to end their mathematical studies at this point. But 
these final subjects were to provide a background for the detailed course of study 
reserved for students who proceeded to De Morgan’s higher senior class. 


3.4 The Higher Senior Class. Attending De Morgan’s lectures as far as his lower 
senior class would have enabled the average student to pass the B.A. examination 
at the University of London, as well as to move on to the study of natural 
philosophy in the college. However, for those exceptionally capable (and keen) 
students who perhaps wished to try for an M.A. degree, it was advisable to enter 
University College’s highest mathematical class. This course was obviously the most 
technically demanding and, although the class would never have been huge, was 
one to which De Morgan clearly devoted much time and attention. 

This is evinced by the 138 notebooks he wrote for this class, more than for any 
other division of his students. This high number of tracts is explained by the fact 
that fewer, if any, elementary textbooks were available on the topics of his higher 
senior lectures. For much of this section of the course, the most useful work would 
have been his Differential and Integral Calculus, since the subject dominated the 
material covered. Other areas were also treated, such as further theory of equa- 
tions, three-dimensional geometry, and probability theory, but their study was 
vastly outweighed by the amount of time devoted to calculus-related topics. 

Chief among these was the study of differential equations, briefly introduced in 
the lower senior class. As with all of De Morgan’s tracts on subjects of some 
complexity, those dealing with the first principles cover each aspect in careful 
detail. It is quite obvious from the sheer number of notebooks relating to the 
various types of equation (around thirty) that De Morgan was anxious that his 
student’s should obtain as much experience and practice of solving them as 
possible. He even wrote an entire tract containing model solutions to questions on 
the subject from University of London examination papers. The chief application 
of differential equations in De Morgan’s higher senior tracts was to the study of 
curves and surfaces, where the subject matter is almost entirely based on the 
differential geometry contained in Gauss’s Disquisitiones Generales circa Superficies 
Curvas of 1827. 

The class was also introduced to a second form of differentiation in order to 
facilitate the subsequent study of mechanics. This was the calculus of variations. 
Much of the material contained in the tracts is also presented in his Differential 
and Integral Calculus [11, pp. 446-75], such as the famous brachistochrone problem 
of finding “‘the curve of shortest descent from one curve to another, a heavy point 
descending upon the curve (supposed hard) by the action of gravity, with no 
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velocity at the commencement”. [44, f. 4] But his treatment of the subject, while 
thorough, was not exhaustive; for example, he directed the more advanced stu- 
dents to “the Memoir of Poisson on the Calculus of Variations, in the twelfth 
volume of the Memoirs of the Institute”. [11, p. 454] 

In addition to the study of these ‘pure’ mathematical subjects, De Morgan also 
managed to include a few items of applied mathematics. Indeed, more time was 
spent on mathematical applications in the higher senior class than in any 
other—although the overall proportion was still minute. One subject considered 
was probability theory, which the students had studied—in its pure form—in the 
higher junior class. De Morgan would now introduce them to its applications, most 
notably its use in error theory, a precursor of what would now be called mathemat- 
ical statistics. De Morgan’s teaching of this subject was also heavily influenced by 
the work of Gauss a few decades before. This is hardly surprising since the main 
topics in this area, such as the weight of observations and the method of least 
squares, were all introduced by Gauss. Thus once again, De Morgan can be seen to 
be acquainting his students with (fairly) recent work on a new and rapidly growing 
area of mathematical research. 

Less recent—but certainly still applied—-mathematics is contained in two 
notebooks on the subject of dynamics. Strictly speaking, this would have been 
taught by the professor of natural philosophy, but De Morgan’s treatment was 
entirely mathematical, dealing purely with theoretical problems involving the 
derivation of equations of motion for particles travelling under certain conditions. 
Moreover, throughout these tracts, he is at pains to stress the distinction between 
the abstract mathematical notions of velocity and acceleration on the one hand, 
and the physical phenomena (e.g., force, pressure, and attraction) that cause them. 
Thus, for example: 


When, as is usual in books on mechanics, acceleration is much confounded 
with force measured by the acceleration it produces...—called accelerating 
force—the centrifugal acceleration, a law of space, gets the name of centrifugal 
force, whether there be such a force in action or not. [52, f. 16] 


His motivation for thus trespassing on materials within the domain of mathemati- 
cal physics was his belief that “the want of sufficient attention to this distinction 
puts some difficulties in the way of beginners in dynamics”. [52, f. 1] In other 
words, he thought that if his students received an adequate notion of velocity and 
acceleration independently of any physical consideration of the properties of 
matter, they would be better equipped to understand the subject of dynamics when 
they came to study natural philosophy. 

Having been given a thorough grounding in most areas of contemporary 
mathematical science, even proceeding far enough in certain subjects to have 
become acquainted with several aspects of recent research, the student would have 
several options for further study. Although the concept of a graduate research 
student did not exist in Britain at this time, the higher senior class would almost 
certainly have served as a good starting point for those aspiring for an academic 
career in mathematics, since it not only provided guidance for those aiming for 
mathematical honours, but also those trying for a London M.A. or preparing to 
embark on a course of study at Cambridge. 


3.5 Overview. In fact, viewing the surviving mathematical tracts as representative 
of De Morgan’s entire syllabus, one is impressed not just by the level to which 
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mathematics was taught, but also by the range of topics to which the students were 
exposed by their professor. To be sure, there is nothing unusual in his basic course 
structure, whereby the subject is developed from arithmetic and Euclid through 
the standard branches of algebra, geometry, trigonometry, and calculus; but this is 
hardly an original feature. Rather, it is the additional, less prominent topics, 
absent from the course outlines and found only in the tracts, which give the course 
its variety and make it particularly distinctive. The result is a mathematical course 
of considerable scope and breadth. 

De Morgan’s course was as advanced as it was varied. Indeed, it extended 
almost as far as an undergraduate course could at the time, since developments in 
many branches (especially analysis and algebra) were transforming the subject 
almost as it was being taught. This is reflected in the fact that the tracts were 
constantly being updated. A good example is found in notebooks relating to the 
theory of equations, one of the most rapidly-expanding areas at this time, which 
yielded new subjects in the form of complex analysis and group theory. Although 
never taught by De Morgan, many of the results that contributed to this latter 
development are contained in a tract written in 1855, featuring “selections from 
what has been recently done in the higher parts of the theory of equations”. 
[48, f. 1] This featured substantial extracts from the second edition of Serret’s 
Cours d’algébre supérieure (1854), the third edition of which, published in 1866, was 
to mark the first appearance of Galois theory in a textbook. 

De Morgan’s mathematical tracts give us far more information about the style 
and structure of his teaching than any printed syllabus or exam paper of the time. 
But ironically, these manuscripts, invaluable though they undoubtedly are, still do 
not tell us exactly what went on in his lecture room. By virtue of the fact that they 
were written explicitly to supplement the students’ notes obtained from the Profes- 
sor’s lectures, the tracts can give us only a general impression of what the students 
would have been taught in person. Fortunately, however, there are three further 
sources of information that can bring us one step closer to understanding just what 
it was like to study mathematics under De Morgan. 


4. STUDENT-AUTHORED ACCOUNTS. Two of these sources are extracts from 
the private writings of two eminent students, the journalist and constitutional 
author Walter Bagehot and the mathematical economist and logician William 
Stanley Jevons. However, the third is perhaps the most valuable, for two reasons. 
Firstly, because it was written by a student of more average ability; and secondly, 
because it is the student’s original college notebook, in which he transcribed 
De Morgan’s lectures as they happened. This comparatively academically undistin- 
guished student was one John Golch Hepburn, also destined to achieve no 
particular eminence following his graduation. However, Hepburn’s notebook pro- 
vides us with a unique insight as to what the student would have experienced in 
De Morgan’s lecture room 150 years ago. 

The manuscript contains notes from 21 of De Morgan’s lower senior lectures on 
algebraic geometry and the differential calculus, delivered between 11 March and 
13 May 1847. The first begins with a study of the ellipse, considering aspects such 
as area and conjugate diameters. The next lecture, on 13 March, deals with 
Kepler’s Laws, orbit-time calculations, and an introduction to parabolae. Hyperbo- 
lae and asymptotes are treated five days later. By 27 March, the emphasis was on 
tangents and chords to conic sections. After an absence from two or three lectures, 
Hepburn’s notes resume on 16 April, when sections of cylinders, cones, and 
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Figure 2. John Hepburn’s undergraduate notes on De Morgan’s Lectures from March 1847. 
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spheres were under discussion. Less than a week later the students were being 
introduced to the differential calculus. 

De Morgan clearly approached the new subject at some considerable speed 
since, on its first day, he was teaching derivatives of fundamental expressions and 
the product and quotient rules, yet, by 24 April, two days later, he was differentiat- 
ing x*. By the end of the month, physical notions such as velocity had been 
introduced, with tangent/normal and max/min problems brought in on 1 May. 
Maclaurin’s theorem was proved for convergent series in the next lecture, followed 
by Taylor’s and Lagrange’s theorems, together with related problems. By 8 May, 
the class had been introduced to the calculus of finite differences, the notes 
concluding with an introduction to the calculus of operations. 

Reference to section 3.3 confirms that Hepburn’s notes correspond very closely 
to topics dealt with in De Morgan’s tracts for his lower senior class, but perhaps 
more remarkable is how rapidly the Professor propelled his students through the 
subject. His introduction to the calculus took him a little over two weeks, 
consisting of just seven lectures. In that time, he discussed first principles, 
including foundational concepts such as limits, as well as derivatives of functions, 
fundamental rules, and elementary applications, before moving on to some crucial 
results in analysis. It is little wonder that he provided tracts for his students to 
augment their lecture notes! 

De Morgan’s homework problems were numerous and far from trivial. A few 
examples from Hepburn’s lecture notes indicate the standard of De Morgan’s 
homework questions at the lower senior level: 


Determine area of parabola as extreme case of area of ellipse. Suppose axis 
major become > &> ; e being nearer & nearer = 1. [38, f. 25] 


Required the [Maclaurin] developments of «**, (1 + x)”, sin x, cos x, tan x, 
and e«°S* to 8" power at least. [38, f. 193] 


Try to give a geometrical proof of the ratio of two magnitudes wh" vanish is 
the same thing as the ratio of their diff. Coeff’. [38, f. 197] 


In addition to such problems, Hepburn’s notes are permeated with references to 
recommended reading. Perhaps the most intriguing citation, contained in the 
lecture on the foundations of the calculus, was “See Leipzig Acts 1684’’? [38, f. 137] 
Thus it would certainly appear from this text that De Morgan’s course was not for 
the faint-hearted, yet perhaps the only detail absent in the document is any 
indication of how difficult the student actually found it. For this information we 
are obliged to refer to our two remaining sources, which fortunately shed some 
considerable light on this question. 

The sources are the diaries and correspondence of Walter Bagehot and Stanley 
Jevons, who attended De Morgan’s lectures during the 1840s and 1850s, respec- 
tively. From both it would appear that mathematics under De Morgan was 
stimulating but never easy. Thus we find Bagehot writing in 1843: “De Morgan has 
been taking us through a perfect labyrinth lately; he was quite lost by the whole 


>This refers to the 1684 edition of the journal Acta Eruditorum Lipsienium, which contained the first 
publication by Leibniz on the differential calculus. 
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class for one lecture, but we are, I hope, getting better, and more gleg® at the 
uptake. We have been discussing the properties of infinite series, which are very 
perplexing.” [1, p. 118] And, as he approached his final exams, Bagehot’s workload 
intensified: “I have been reading some of the Theory of Numbers, which 
De Morgan says is the best exercise for the head possible, and certainly is a hard 
stretch for my reading powers and memory.” [1, p. 159] 

Stanley Jevons experienced De Morgan’s teaching a decade later than Bagehot, 
and at two different periods. These are recorded in detail in the diary and 
correspondence written by him during his college years. It is thus in his memoirs 
that perhaps the fullest and most candid account of experiences as a student of 
De Morgan can be found. These examples give an evocative description of one 
man’s study of mathematics at University College under the tuition of Augustus 
De Morgan: 


In mathematics we are just beginning the theory of equations, and during the 
last week have got through Descartes’, Fourier’s, and Sturm’s theorems of the 
limits of the roots of equations. They are the most truly difficult things we 
have come to, and I do not thoroughly understand them yet. [23, p. 29] 


...one learns more and more to adore De Morgan as an unfathomable fund 
of mathematics. We were delighted the other day when, in the higher senior, 
he at last appeared conscious that a demonstration about differential equa- 
tions, which extended through the lecture, was difficult; he promised, indeed, 
to repeat it. But then one is disappointed to find that the hardest thing he 
gives in any of his classes is still to him a trifle, and that the bounds of 
mathematical knowledge are yet out of sight. [23, p. 150] 


5. CONCLUSION: A COMPARISON OF CONTEMPORARIES. It is not only 
through contemporary student accounts that we can determine the calibre of De 
Morgan’s teaching. In later years, many of his students recalled the effect he had 
on them. More than half a century after experiencing his teaching in the late 
1840s, the English historian Thomas Hodgkin wrote: 


Towering up intellectually above all his fellows, as I now look back upon him, 
rises the grand form of the mathematician, Augustus De Morgan, known, I 
suppose to each succeeding generation of his pupils as ‘Gussy’. A stout and 
tall figure, a stiff rather waddling walk, a high white cravat and stick-up 
collars in which the square chin is buried, a full but well chiselled face, very 
short-sighted eyes peering forth through gold-rimmed spectacles; but above 
all such a superb dome-like forehead, as could only belong to one of the 
kings of thought: that is my remembrance of De Morgan, and I feel in 
looking back upon his personality that his is one of the grandest figures that I 
have known. [2, p. 80] 


Hodgkin was not the only non-mathematician on whom De Morgan made a 
substantial impact. Reminiscing in 1921, the lawyer James Bourne Benson af- 


firmed that “De Morgan [was] looked upon with awe” [3, f. 3] by the undergradu- 
ates of his day. The distinguished chemist Sir Henry Enfield Roscoe went further 


°A Scots word (found in the poetry of Robert Burns) meaning astute, quick, keen, or alert. 
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still, opining that De Morgan was more than “merely a mathematician and a 
unique teacher; he was one of the profoundest and subtlest thinkers of the 
nineteenth century”. [32, p. 25] “As a teacher of mathematics,” wrote Jevons, 
“De Morgan was unrivalled. ... De Morgan’s writings however excellent, give little 
idea of the perspicuity and elegance of his viva voce expositions, which never failed 
to fix the attention of all who were worthy of hearing him,” [24, p. 8] 

Jevons was later to acknowledge the profound effect of De Morgan on his 
intellectual development, and it is clear that the careers of many other former 
students were also influenced in some way by De Morgan’s teaching. Francis 
Guthrie, the originator of the Four-Colour Conjecture, became a professor of 
mathematics in South Africa; E. J. Routh moved to Cambridge, becoming one of 
the most successful mathematical coaches in its history; and Isaac Todhunter 
achieved renown for a highly successful series of textbooks as well as his research 
into various aspects of the history of mathematics. 

This influence over students reminds one of the effect of Karl Weierstrass on 
many of those who attended his lectures at the University of Berlin in the latter 
half of the nineteenth century. While Weierstrass had far more first-rate mathe- 
matics students than De Morgan (among them Otto Hélder, Adolf Hurwitz, Felix 
Klein, Hermann Minkowski, Gosta Mittag-Leffler, Hermann Schwarz, and E. H. 
Moore), the image of a mathematician who “became a recognised master, primar- 
ily through his lectures” [4, p. 221] is consistent to both. 

One could not provide a greater contrast to the didactic methods of De Morgan 
and Weierstrass than those of their earlier contemporary Augustin-Louis Cauchy. 
As a professor at the Ecole Polytechnique in Paris from 1816, Cauchy taught 
mathematics for trainee engineers. Yet, while his nineteenth-century biographer 
C. A. Valson extols his virtues as a teacher who “never left a subject until he had 
completely exhausted and elucidated it’, [53, I, p. 64] the records of the Ecole 
contain a damning report of 1821 in which it is claimed that “numerous warnings 
have been given, for 5 years, to Mr. Cauchy to undertake to simplify his methods 
and to conform exactly to the programmes [of lectures].” [19, II, p. 711] Further- 
more, it is asserted that “there has sometimes been...a lack of clarity in his 
lectures...” (19, II, p. 712] 

Cauchy was also admonished by his superiors for “the lack of that aplomb that 
one must generally suppose of a professeur already celebrated in the sciences”. 
[19, II, p. 711] In particular, his punctuality was criticised when he arrived for a 
lecture “10 minutes after the gathering of the Students in the amphitheatre”’. 
[19, II, p. 711] This characteristic was certainly not shared with De Morgan, who 
consistently placed great emphasis on the precision of his arrival. Indeed, he was 
apparently “so punctual and so regular in the performance of his college duties 
that his passage to and from his classes served as a time-piece to observant 
students”. [28, p. 115] Moreover, during his time at University College, he became 
one of its most conscientious and respected professors, lecturing from 9 to 10am 
and 3 to 4pm every day except Sundays. Indeed, according to Roscoe, “the trouble 
he took with students was extraordinary”. [32, p. 25] 

De Morgan’s innate mathematical propensity was enriched by strong commu- 
nicative skills, which, together with a talent for presenting complex ideas in an 
intelligible form and a pithy lecturing style, resulted in the ability to captivate his 
audience irrespective of the topic he was treating. It would also appear from more 
than one source that, as with other areas of his instruction, in order to foster 
correct notions. in his students, De Morgan’s keen sense of humour was often 
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employed as a pedagogic tool: 


One thing which made his classes lively to men who were up to his mark, was 
the humorous horror he used to express at our blunders, especially when we 
took the conventional or book view instead of the logical view. The bland 
“hush!” with which he would suppress a suggestion which was simply stupid, 
and the almost grotesque surprise he would feign when a man betrayed that, 
instead of the classification by logical principles, he was thinking of the old 
unmeaning classification by rule in the common school-books, were exceed- 
ingly humorous, and gave a life to the classes beyond the mere scope of their 
intellectual interests. [15, pp. 97-8] 


Here again, De Morgan’s formal but good-humoured style contrasts with that of 
some of his contemporary counterparts. It has been said that “it was only gradually 
that Weierstrass acquired the masterly skill in lecturing extolled by his later 
students. Initially his lectures were seldom clear, orderly, or understandable.” 
[4, p. 221] Moreover, claimed Felix Klein (himself a highly successful lecturer), the 
“imposing” personality of the great man “gave his lectures a distinctly unconge- 
nial, authoritarian quality.” [25, p. 163] 

De Morgan was able to balance congeniality with firm discipline to maintain a 
serious but convivial atmosphere in his lecture room. It was not a skill all of his 
fellow lecturers were able to achieve. James Clerk Maxwell was arguably one of 
the foremost British mathematical physicists of the time when he lectured at King’s 
College London in the early 1860s. Yet, remarkable though his scientific creden- 
tials may have been, “as a teacher of raw youths,...he did not prove to be a 
success ...and it seems not unlikely that the students were too much for him.” 
[22, p. 247] Maxwell’s inability to maintain order in his classes was exacerbated by 
additional shortcomings as a lecturer. A recent biographer explains: “The evidence 
is that, as a teacher, he had unusual difficulties. His delivery was poor. He could 
control neither the speed of his thoughts nor the flights of his mind... . Very 
likely only the occasional, particularly brilliant student could follow his lectures.” 
[36, p. 100] 

The same could be said of the teaching of James Joseph Sylvester, himself a 
former student of De Morgan, who, while an outstanding pure mathematician, was 
by no means a clear lecturer. In fact, as De Morgan later recalled. “When he was 
with us [as professor of natural philosophy at University College from 1837-41] he 
was an entire failure: whether in lecture room or in private exposition, he could 
not keep his team of ideas in hand.” [37] Yet, forty years later, the very qualities 
that had made him unsuccessful as an undergraduate lecturer proved inspiring to 
his graduate students at Johns Hopkins University. As one later recalled: “One 
could not help being inspired by such teaching, and many of us were led to 
investigate on lines which he touched upon.” [25, p. 81] Thus, a style of lecturing 
far less structured and meticulously arranged than that of De Morgan, was no less 
successful in holding the attention of the able students. 

So what can we conclude from these comparisons of teaching characteristics of 
De Morgan and his contemporaries? What are the distinguishing features of a 
great teacher? First of all, the lecturer must be capable of delivering tuition by 
means of a clear and systematically planned series of lectures to form a structured 
and intelligible course. Clearly, De Morgan was amply equipped to do this, while 
Sylvester, Cauchy, and even Weierstrass to begin with, were not. Secondly, he must 
be able to maintain order in the lecture room so as to keep the attention of the 
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class focused on the subject material. Here again, De Morgan satisfies this 
criterion, whereas others, most notably Maxwell, do not. 

But finally, one must be able to inspire one’s students with a love and fascina- 
tion for their subject. De Morgan did this by concentrating on inculcating a deep 
understanding of fundamental principles rather than a mere skill with techniques 
and manipulation. Of course, Weierstrass, Klein, and Sylvester achieved success, 
but this was largely attained by teaching graduates whose interest in the subject 
was already strong. De Morgan’s achievement lies in the fact that he was able to 
persuade so many undergraduates (even those who took the subject no further) of 
the beauty and allure of mathematics. 

Without question Cauchy, Weierstrass, Klein, Maxwell, and Sylvester were all 
great mathematicians. Some (Weierstrass and Klein in particular) were great 
teachers as well. But whether De Morgan could also be called a “great” mathe- 
matician is debatable. True, he played a major role in the development of symbolic 
logic and its introduction into mainstream mathematics; he also contributed to the 
growth of abstract algebra, and promoted the use of Cauchy’s limit-based approach 
to the calculus, but his mathematical output, while by no means trivial, hardly 
merits the term “great’’. As a researcher he is clearly not in the same league as 
those already listed. But, for his work in the lecture room, he certainly deserves a 
place in the first rank, possessing in abundance all the attributes of a memorable 
and effective educator. Indeed, if De Morgan’s mathematical reputation had to 
rest on any one achievement, it would have to be as a teacher to whom the term 
“great” could truly be applied. 
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The TEAM Approach to Investing 


Frank Gerth III 


1. DESCRIPTION OF STRATEGIES. We consider an investor with a portfolio of 
“stock-like” investments (e.g., an S&P 500 Index fund) and “cash-like” invest- 
ments (e.g., Treasury bills, CDs, and high-grade commercial paper). S(t) and C(t) 
denote the values at time ¢ of this “stock fund” and “cash fund,” respectively, and 


V(t) = S(t) + C(t) (1.1) 


denotes the total portfolio value at time ¢. The ratios r(t,h) =[V(t +h) —- 
V(t)|/V(t) have been studied exhaustively for many such portfolios and for time 
increments / ranging from days, weeks, and months, to quarters, years, and even 
longer. For short intervals h, the ratios r(t,) may be positive or negative, are 
seldom large, and are difficult to distinguish from independent identically dis- 
tributed random variables with constant (or slowly varying) means and variances. 

At an initial time ¢,, we suppose amounts S, and C, are invested in the stock 
fund and cash fund, respectively. We compare the results of two strategies. The 
first is a buy-and-hold strategy with no exchanges between the stock fund and cash 
fund. The second involves periodically reallocating money between the stock fund 
and cash fund in a way that maximizes a certain function that is described in the 
next section. We call the first the BH (Buy-and-Hold) strategy and the second 
the TEAM (Target Equity Allocation Management) strategy. Our main result 
(Proposition 1) is that the TEAM strategy produces a higher expected total 
portfolio value than does the BH strategy for the same level of risk. 

What are the reallocations in the TEAM strategy? Pick a convenient time 
interval h, and let t; =¢, + ih for i= 1,2,...,n. Suppose the investor moves 
money back and forth between the stock fund and cash fund only at the instants f,; 
for 1=1,2,...,2 —1, in an attempt to obtain a large total portfolio value 
Vit) = St,) + CU,). For i = 1,2,...,n, let s; (respectively, c;) denote the rate 
of return of the stock fund (respectively, cash fund) during the i-th time period. In 
the TEAM strategy, money is moved between the stock fund and cash fund at the 
instants ¢; so that 


S(t;) = So(1 +c,)(1 + ¢,):(1 + ¢;) (1.2) 
for i = 1,2,...,n — 1. The reason for these stock fund allocations is explained in 
the next section. In contrast, in the BH strategy, 

S(t;) = So(1 + 5,)(1 +s.) °° (1 + 5;) (1.3) 


for each i. 


2. ANALYSIS OF STRATEGIES. We suppose that the s,’s (respectively, c,’s) are 
independent identically distributed random variables with means pw, and variances 
co,” (respectively, means wu, and variances a,”). We assume p, > pw, and a,” > a,”. 
The first inequality reflects the fact that stock-like assets tend to appreciate more 
rapidly than cash-like assets, while the second inequality expresses the fact that 
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stock-like assets appreciate at a riskier (less predictable) rate than cash-like assets. 
Generally speaking, investors are paid to bear risk, with extra pay for extra risk. 

Let A, denote the dollar amount transferred from the stock fund to the cash 
fund at time ¢,. If A; < 0, money is transferred from the cash fund to the stock 


fund. (In the BH strategy A, = --- = A,_, = 0.) Write S, = S(t) + ih), C, = C(t, 
+ ih), and V,; = V(t) + ih). For any sequence A,,...,A,_,; 
Faeroe) Wry (ile at) eoaae A (2.1) 
and 
C;=C,_,(1 + ¢,) + A; (2.2) 
for i = 1,...,m. We may suppose A, = 0 since A, is of no consequence. Then 
V,=S,+C,=V1C1 + ¢;) + $;105; — ¢;) (2.3) 


for each i. Note that V(t) = S(t) + C(t) is continuous across the boundaries 
(instants) separating consecutive time intervals, even though S(t) and C(t) are not. 

The investor may cause the quantities S,), S,,..., 8, —, to assume any values in 
the intervals [0,V,],[0,V,],...,[0,V,_,] by choosing S, and A,,...,A,_, appro- 
priately. Even these constraints may be eliminated by “selling short” stock-like 
assets or cash-like assets, e.g., borrowing money in the latter case. Therefore, it 
seems natural to ignore the constraints 0 < S; < V, during the initial search for an 
effective investment strategy. 

Now let g, = 1 and 


b= Sal +) = ee (Le, 
for i > 1. Making the substitutions V, = g,X, and S,; = g,u;,, in (2.3) gives 


X, = X;_, + Ryu; (2.4) 
where | 
R, = (s, — ¢;)/(1 + ¢;) (2.5) 

for each 1. Then X,) = V, and 
X,= Xo + Ryu, +o + Ru, (2.6) 
for each i > 1. The investor’s choice of the numbers u,,...,u,, is entirely unre- 
stricted, and the investor presumably chooses u,,...,u,, in an attempt to maximize 


an appropriate function of portfolio performance. 

For subsequent calculations it is useful to assume that o,° = Var(c;) = 0 for 
each i, so that c, = + =c, =m, = Cc, a numerical constant, and then g, = 
(1 + c)' for each i. In practice, Var(c;) is usually so much smaller than Var(s;,) that 
it seems natural to ignore Var(c;). Indeed, the availability of a “riskless rate c of 
return” is an essential feature of the “Black-Scholes environment” in which most 
financial analysis is performed. , 

Next let M, = V,g,, which is the amount the cash fund would contain if the 
entire initial investment V, had been placed in the cash fund and left alone. We 
would like the terminal value V, of our chosen portfolio of stocks and cash to 
exceed M, by a large amount. However, to take into account the riskiness of our 
portfolio, we choose to maximize the function 


f(V, — M,) = E(V, — M,)/yVar(V, — M,) . (2.7) 
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which increases with the expected value of V,—M, and decreases with the 


standard deviation of V, — M,. This function is essentially the Sharpe reward-to- 
variability ratio [2]. Now from (2.6) 


V, = M,, =e CX, — X) = g,( Ryu, ee +R Uu ) 


=(1+c)"(R,u, ++ +R,u,) (2.8) 


under our assumption that c; = c for each i. Then (2.5) and our assumption that 
the s,’s are independent identically distributed random variables imply that the 
R,’s are independent identically distributed random variables with means 
(u, —c)/( +c) and variances o,7/(1 + c)’. 

We now consider a special case for the u,’s; namely, we assume that the u,’s are 


chosen in a way that does not depend on the stock fund rates of return s,, 55,...,5,,. 
Then (2.7) and (2.8) imply 

f(V, — M,) = K¢e,u) /llull, < Kllell, = Kvn (2.9) 
where u is the vector (u,,..., u,,), e is the vector with n components each equal to 


1, and K =(p, —c)/o,. The inequality in (2.9) is a consequence of the Cauchy- 
Schwarz inequality, with equality if and only if u = Ae for some positive scalar 4. 
Since u, = Zou, = So, then equality in (2.9) occurs precisely when u = S,e with S, 
positive. Then 


S; = g;Uj., = So(1 + c)' (2.10) 


for i= 1,...,n — 1. Note that (2.10) is the same as (1.2) under the assumption 
that c;=c for each i. This optimal strategy corresponds to the choices A, = 
(s, —c)S,_, fori =1,...,n — 1 in (2.1). Each S,/S,;_, = 1 + c, which means that 
the stock fund allocations increase at the riskless rate c. This strategy is called the 
Target Equity Allocation Management (TEAM) strategy since it allocates re- 
sources between the riskier stock fund and more conservative cash fund in a way 
calculated to achieve the modest equity targets S,; = S,(1 + c)’ in the stock fund, 
while transferring anticipated surpluses to the cash fund. 

Although the TEAM strategy maximizes the function (2.7) among all strategies 
for which the u,’s do not depend on the stock fund rates of return s,,...,5,, we 
should expect some feedback type strategies (in which the u,’s depend on the s;’s) 
to produce greater values for f(V, — M,,). In the buy-and-hold (BH) strategy, the 
u; S do depend on the s,’s since A, = -:- = A,_, = 0 imply that 


(1+ c)'u,,, =S,=S,(1+5,)--(1+5,) (2.11) 


for.i = 1,...,mn — 1. Hence there is still some work to do to show that the TEAM 
strategy imparts a higher value to the function f(V, — M,) than does the BH 
strategy. 

To avoid confusion with other strategies, let B, rather than V, denote the 
terminal portfolio value for the BH strategy. From (2.5) and (2.11), R; depends on 


s;, whereas u; depends on s,,..., 5;_,. Thus, R; and u; are independent, and (2.8) 
implies 
E(B, — M,) = (1+c)"(Rya, + + +R,2,) (2.12) 
where 
R; = E(R;) = (4, -—¢)/( +e) (2.13) 
i; = E(u;) = S(1+ ,)' /(t+c) (2.14) 
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fori = 1,...,n. A straightforward but lengthy calculation shows that 


Var(B, — M,) > (1 +c)" "9,2 (a + + +072). (2.15) 
Then (2.7), (2.12), (2.13), and (2.15) imply 
f(B, —M,) < Ke, w) /Ilwll. < Kllell, = Kvn (2.16) 


where w = (u,,...,u,),e = (1,..., 1), and K = (u, — c)/o,. The second inequal- 
ity in (2.16) is a consequence of the Cauchy-Schwarz inequality and the fact that 
w ~ Ae for a scalar A since pw, > c in (2.14). 

Now if 7, is the terminal portfolio value for the TEAM strategy, then 


f(B, — M,) < Kvn = f(T, - M,). (2.17) 
For the TEAM strategy, the analogs of (2.12) and (2.15) are 
E(T, — M,) = (1 +c)"(R)Sp + + +R, So) = (1 +)" 'n( pw, — €) Sy 
(2.18) 


Var(T, — M,) = (1 +c)" Pn 2S8?. (2.19) 


Since yw, > c, then (2.12), (2.14), (2.15), (2.18), and (2.19) imply that for the same 
initial allocations in the BH strategy and in the TEAM strategy, the BH strategy 
has a higher expected terminal portfolio value and a higher variance than does the 
TEAM strategy. Next, observe that the value f(Z, — M,) = Kvn in (2.17) does not 
depend on the initial allocations in the stock fund and cash fund. Hence we could 
increase the initial allocation to an amount So in the stock fund in the TEAM 
strategy (while correspondingly decreasing the initial allocation to an amount C4 in 
the cash fund) until Var(7’ — M,) = Var(B, — M,), where T’ is the TEAM 
terminal portfolio value for initial investments of Sj and Cj in the stock and cash 
funds, respectively. Then (2.7) and (2.17) imply E(T’) > E(B,). 

We now list the primary assumptions that are used in our derivation and then 
state the proposition we have proved. 


The cash fund rate of return is constant over all time periods. (2.20) 


The stock fund rates of return in each time period are independent (2.21) 
identically distributed random variables. 

The expected stock fund rate of return exceeds the cash fund rate (2.22) 
of return. 

There are no taxes or transaction costs. (2.23) 


Proposition 1. Suppose an amount S, (respectively, Sj) is invested initially in the 
stock fund in the BH strategy (respectively, TEAM strategy) and an amount C, 
(respectively, Co) is invested initially in the cash fund in the BH strategy (respectively, 
TEAM strategy). Let B,, (respectively, T') denote the terminal portfolio value for the 
BH (respectively, TEAM ) strategy for these initial allocations. Suppose So and Co are 
chosen so that (Sj + Cy) = (Sy + C,) and so that the standard deviations of T' and 
B,, are equal. Then under the assumptions (2.20) through (2.23), the expected value of 
T,, is greater than that of B,,. Hence for the same level of risk (as measured by standard 
deviation of terminal portfolio value), the TEAM strategy produces a higher expected 
terminal portfolio value than does the BH strategy. 


If S, is large relative to C,, then it could happen that S$, > GS, + C,), in which 


case Cy, < 0. This corresponds to borrowing money in the TEAM strategy. Also, 
reallocations in the TEAM strategy could require borrowing money. 
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The assumptions (2.20) through (2.23) are not precisely satisfied in practice, and 
our model is a very simplified financial model. Nevertheless it is interesting to 
compare simulation results using the BH and TEAM strategies. 


3. SIMULATION RESULTS. We examine the results of some simulations using 
historical data from 1926 to 1995. Most of the data comes from Ibbotson and 
Sinquefield [1, pp. 54-55]. For rates of return for our stock fund, we use rates of 
return for Common Stocks (which correspond to the S & P 500 Index); for our cash 
fund, we use the rates of return for U.S. Treasury Bills. We exclude transaction 
costs and taxes, and consider 14 non-overlapping 5-year periods from 1926 to 1995. 
In each 5-year period, the TEAM strategy reallocations occur annually, so n = 5. 
There are no reallocations in the BH strategy during each 5-year period. The data 
in [1] show that the serial correlation for annual rates of return is near zero for a 
stock fund such as an S&P 500 Index fund, which is consistent with what we 
would expect from (2.21). 

Since the TEAM strategy involves lower risk than the BH strategy when the BH 
and TEAM strategies have the same initial allocations, one can increase the initial 
percentage allocation to the stock fund when using the TEAM strategy. Let S) 
(respectively, Sj) denote the initial percentage allocation in the stock fund when 
using the BH strategy (respectively, TEAM strategy), and set 

| a = S)/S). (3.1) 
The TEAM simulation results in Table 1 correspond to a = 1.4. We also per- 
formed simulations with a = 1.0, 1.1, 1.2, 1.3, 1.5, 1.6, 1.7, and 1.8. As one would 
expect, the average terminal value for the TEAM strategy (and its standard 
deviation) increased as a increased. However the standard deviation for the 
TEAM strategy was less than the standard deviation for the BH strategy for 
a < 1.6. For 1.2 <a < 1.6, the TEAM strategy produced a higher terminal value 
than the BH strategy in at least 12 of the 14 five-year periods. 


TABLE 1 Total Returns for Two Strategies 


Strategy Buy and Hold (BH) TEAM 
Initial Value $1.0000 $1.0000 
Stock fraction 5000 .7000 
Cash fraction .5000 3000 
Time Periods Terminal Values ($) Difference 
(TEAM — BH) 
1926-1930 1.3476 1.5202 1726 
1931-1935 1.0964 1.3220 2256 
1936-1940 1.0150 1.1413 1263 
1941-1945 1.6010 1.6461 0451 
1946-1950 1.3221 1.3977 0756 
1951-1955 1.9973 1.9555 — 0418 
1956-1960 1.3336 1.4382 1046 
1961-1965 1.5135 1.5951 .0816 
1966-1970 1.2413 1.2481 .0068 
1971-1975 1.2477 1.3398 0921 
1976-1980 1.6874 1.7785 0911 
1981-1985 1.8094 1.9154 .1060 
1986-1990 1.6226 1.7063 0837 
1991-1995 1.6933 L713 0780 
Average 1.4663 1.5554 0891 
Standard Deviation 2823 2520 .0643 
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For a = 1.4 in Table 1, the average difference of .0891, 1.e., an average 5-year 
total return difference of 8.91% for the TEAM strategy over the BH strategy, is 
significantly greater than zero at the 1% level for the f-statistic for the variable 
(TEAM — BH) terminal value. One might argue that this variable is not normally 
distributed, and hence the t-statistic might not be appropriate. Further statistical 
justification for concluding that the (TEAM — BH) terminal value is significantly 
greater than zero is the fact that the TEAM terminal value exceeded the BH 
terminal value in 13 of the 14 time periods. Hence with an appropriate choice of a, 
the TEAM strategy can produce a higher terminal value than the BH strategy, with 
no greater risk than the BH strategy. 

The data in Table 1 correspond to the assumption that each stock fund 
investment S; in the TEAM strategy is limited to the total value 77. If borrowing is 
allowed, the TEAM strategy results are slightly better. 
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From the MONTHLY 25 Years Ago... 


THE DERIVATIVE SONG 
Words by Tom Lehrer—Tune: “There'll be Some Changes Made’”’ 


You take a function of x and you call it y, 

Take any x-nought that you care to try, 

You make a little change and call it delta x, 

The corresponding change in y is what you find nex’, 
_And then you take the quotient and now carefully 

Send delta x to zero, and I think you’ll see 

That what the limit gives us, if our work all checks, 

Is what we call dy /dx, 

It’s just dy /dx 


MONTHLY 81 (1974) 490 
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NOTES 


Edited by Jimmie D. Lawson and William Adkins 


A Physically Motivated Further Note on 
the Mean Value Theorem for Integrals 


William J. Schwind, Jun Ji, and Daniel E. Koditschek 


The purpose of this Note is to extend the following result by Zhang concerning the 
mean value theorem for integrals [4], which in turn was an extension of Jacobson’s 
result [1]. 


Theorem 1. Suppose the function f is continuous on the interval a, b], and is k times 
differentiable at a with f(a) = 0 i = 1,2,...,k — 1), f(a) # 0. If &, is such that 


[10 dt =f(E)(x~ a), (1) 


then 


lim = ara e, (2) 


As a side note, Jacobson [1] states that his result may fall into the category of 
“interesting facts we once knew, but have now forgotten.” This indeed seems to be 
the case, as the right-hand side of (2) is derived in [2, p. 78]. 

We extend Theorem 1 to a considerably larger function class, which includes 
functions such as f(t)=vt—a and f(t)=1/vt—a that do not satisfy the 
hypotheses of Theorem 1. Additionally, we apply the new results to obtain 
approximations of integrals appearing in familiar engineering settings. In fact our 
consideration of this problem was motivated by the desire to approximate, in 
closed form, integrals arising from a class of central force problems and to do so 
independently of the particular mathematical form of the force law itself. 


Theorem 2. Suppose f is continuous on (a, b] and g is integrable on (a,b) with 
g(t) > 0 forall t € (a,b). Let x € (a, b]. If both lim,_, (f(t) — K)/(t — a)" and 
lim, _, ,g(t)/(@ — a)* exist and are nonzero for some constant K, some nonzero r, and 
somes > —1withr+s > —1, then 


1. there exists a €, € (a, x] such that 


f f(t) g(t) dt =f(&) f g(t) dt; (3) 
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2. for any such choice of €,, 


ae s+1 \" | 
lim = —* = (AY (4) 


Proof: Define C, =lim,_, (f(t) —K)/(@—- a)’, C, =lim,_,.,g(@)/(t-a)’, et) = 
(f(t) — K)/(t — a)” — C,, and ¢,(t) = g(t)/(t — a)’ — C,, so that 
f(t)=K+C(t—a)’ + e(t)(t- a)’ (5) 
and 
g(t) =C,(t—a) + ,(t)(t—a), (6) 
where €(t) > 0 as ¢t — a for i = 1,2. 
First we must justify that the integral on the left hand side of (3) actually exists. 
Fix 6 € (a,b) such that e(t) < 1 for a<t< 6. Now |f(t)g(@)| < Kt — a)’ + 


K,(t — a)**" for some constants K, and K,. The Dominated Convergence Theo- 
rem [3, p. 291] ensures that /°f(t)g(t) dt exists. Therefore, {*f(t)g(t) dt exists. 


Proof of (1). Case 1 (r > 0). It is easily seen from (5) that lim,_,, f(t) = K. Define 
F(a) = K and F(t) = f(t) for t € (a, b]. Clearly, F(t) is continuous on [a, b]. By 
applying the Integral Mean Value Theorem [3, p. 281] to F(t), there exists a €, in 
(a, x) such that 


[ts dt = frost) dt = F( é) ['9(t) dt = f( é) fat) dt. 


Case 2 (r < 0). Without loss of generality take C, > 0 and /*g(t) dt > 0. Then 
(5) requires that lim, _,, f(t) = +. Thus there exists a positive 6 such that 


pry > BADE at 


; =y, foralla<t<a+t6. 
Ja 8(t)at 


Therefore, a €, satisfying (3), if it exists, is not contained in (a,a + 6). For 
a, €(a,a+ 8), f is continuous on [a,, x] and f(t) > 7 on (a, a,]. Hence, 


min f(t) = ae our 


teE(a, x] 


Therefore, 


max A) >f(a,)>n= mn JM) = om fC), 


te[a,,x 
By applying the Intermediate Value Theorem to f(t) on [a,,x], we obtain 
Eé. © [a,, x] C (a, x] such that f(é,) = 7. 
Proof of (2). Substituting (5) and (6) into the left hand side of (3), one sees easily 
that 
C,K(x-—a)*™’ C,C,(x - ay"? 
s+1 ras tf 


+ Cf e(t(t —a)"’ dt + K fe,(t)(t —a) dt 


[FO 8(2) dt = 


+ Cfelyce — a)" dt + fredte(ty( ay ae 1) 
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On the other hand, substituting (5) and (6) into the right hand side of (3), we find 


x — C,K(x-a)'™ — C\C,( &, - a)"(x - a) 
f(&) J a(at = i st. 2 a i. - 


Cre,( €,)(& -— a) (x —a)y™ x , 
Rea 5 21 + KJ e(t)(t—a) dt 


+ C(& - a)" fe(t)(t- a)’ dt 


+ e( E)(E 4)" PPe(\(r- a) a, (8) 
Equating (7) and (8) and simplifying gives 
(a) 
where 
yay HE), (8+ DEKE (DUE = 0)" a 
C, C,C,(x - a)’ 
: (s + 1) fz e,(t)(t — a)’ dt 
C(x _ a)**' 

and 


st+1 (s+ 1)[%e(t)(t —a)"” dt 
Se + a ee ee ee eS a ee 
rt+s+1 C(x -a)**? 


(stl) fPe(t)(t— a) dt (s+ Iffe(t)e(t)(t— a)" dt 
+ + pp De ee a pe ga Oa een eg) 
C,(x — a)" C,C,(x - a)" 
Because |é, — a| < |x — a| and 


_ Ja A(t)(t — a)” dt 
— = 


li a =0 if limd(t)=0, m> —-1, 
x->a (x — a) xa 
we obtain 
s+il1 
liimU=1 and lim V = ———— > 0. (10) 
xa x-a r+s+1 
Now (4) follows from (9) and (10). & 


Often, a good choice for K in Theorem 2 is to take K = lim,_,, f(t) if the limit 
exists or K = 0 otherwise. 

A few immediate observations can be made when g(t) = 1. In this case (3) is 
identical to (1), the statement of the Mean Value Theorem used by Jacobson and 
Zhang. 
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Observation 1. Jf g(t) = 1 in Theorem 2, then s = 0 and 


lim al ) (11) 


xr-a X —A€@ 


for some nonzeror > —1. 


Notice that the form of the limiting value in (11) is identical to that in (2), but 
r © (—1,©) \ {0}, while k is only a natural number. 


Observation 2. Theorem 1 is a special case of Theorem 2. 


To see this, assume the hypothesis in Theorem 1; then 


tI). a a Ee 6 ee) 
ae (=a) See Ga). ase k'\(t — a) a 


where the first equality is obtained by using L’Hospital’s rule k — 1 times while the 
the third one follows from the definition of f(a). 


Observation 3. If, motivated by (4), we approximate &, by 


4 


ed or x near a, 
I ) f 


g-a+| 
and replace &, by é. in (3), we obtain an approximation scheme to the integral 


[Pa dt = f( E) [a dt forxneara. (12) 


A CENTRAL FORCE EXAMPLE. Consider the simple central force problem in 
which a mass on a Spring is restricted to move in the vertical direction. Let the 
spring potential be given by U(y), where y is the distance from the ground to the 
mass. Then the dynamics are given by 


DU(y) 


he oe | i 
y g = (13) 


where g is the acceleration due to gravity and —DU(y) is the spring force; as a 
matter of notation, DU(y) = U'(y). 

Since we assume no losses, the total energy is a constant of motion and we can 
formulate the integral for time as 


tte Sie Eee, a 
WS Hy (Ye — #) + (Us) — UCD) 


where y, is location of the mass when the vertical velocity is zero. 

Under reasonable assumptions on the spring potential, which will be made clear 
in the following, we can approximate (14) using Observation 3. 

Other problems, having integrals that resemble (14), could also be considered. 
Examples are the integrals for swing time and swing angle of a rotating mass on a 
spring or the integral for swing time of a simple pendulum. It is well known that 
the solution of this latter example can be formulated as an elliptic integral and 
thus several well-known approximation techniques can be applied. The results 
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presented in this Note could be used as well, but additionally, and more impor- 
tantly, they may also be applied if there are other forces, such as a torsional spring, 
acting on the pendulum. 

We focus on applying the results of Theorem 2 to our central force problem. 
Suppose we desire to approximate the length of time it takes for the mass to move 
from the point of maximal compression, y,, to some other location, y. The result is 
given by (14) with y, = y,. Furthermore, suppose we wish to solve this problem 
without assuming a particular functional form for the spring potential, U(y). In 
such a case, it is impossible to integrate (14) in closed form. However, (12) provides 
an approximation to the integral of interest. 

To apply Theorem 2, we need to factor the integrand of (14) into the product of 
two functions, f and g. This factorization is by no means unique and each choice 
results in a slightly different approximation to the integral. We illustrate two 
possible choices and discuss the advantages and disadvantages of each. 

Let us first consider 


1 
g(y) =1 and f(y) = ——$—$——$———_—__—.. 
(y, —¥) + (UO) — UY) 


In this case s, = 0. Choosing K = 0 and under reasonable assumptions on U(y), 
that is, 


_ U(y) — Uy») 
im ———--— 
POY Y—Yp 
we find r, = —1/2. Note that the exception U'(y,) = —mg, implies y, = 0. Since 


by definition y, = 0, this corresponds to an equilibrium of the system. From (12), 
we have an approximation to the integral (14) 


A Y~Yp 
On See 
y2g (y, — é1) + —(U(%») = u( é)| 


where é =y, + 1/4)(y — y,). 
Now consider 


= U'(y,) exists and is not equal to —mg, _—_ (15) 


(16) 


1 1 
g(y) = ——-_ and:Ssf,(y) = 
a VY —Yo de -1+ L(Yow= 40») 
mg y-Yp 
Here, we find s, = —1/2. If we assume (15), we can choose K = lim, ., ,, f,(y). In 


order to solve for r,, we need to apply L’Hospital’s theorem. This, however, does 
not provide enough information to determine r,—we need to know more about 
the structure of U(y). If, for example, we assume 


tien UC LYVLY = ¥0) — (UO) = VOR) 
Boe (y —Y)° 


we find r, = 1. If, however, U"(y,) = 0, we must apply L’Hospital again and we 
find that if 


= U"(y,) #0 exists, (17) 


sion LOO = ¥0)" = (UO = 90) = (U(X) = UOs))) 
ae. (yy) 
= U"(y,) #0 exists, 
then r, = 2. 
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Let us assume (17) holds; this implies r, = 1. In this case, using (12), we have 
another approximation to the integral (14), 


1 yo 4 
ta oor Ae See 
2g + (vege) y VY Y 


eee 
mg by — Vp 


2yy—y 
rere elie ey (18) 
1 { UC») - UE) | 
y2eq = Pe ae 
e al by — Ye | 


where é? =y, + 1/3)y — y,). 

In this approximation strategy, each approach has its own advantages and 
disadvantages. The second approach has the advantage of extracting the “domi- 
nant” behavior of g and integrating that exactly, but has the drawback of requiring 
more explicit knowledge of the spring potential law in order to calculate r. The 
first approach, while not offering the exactness of the second, allows greater 
flexibility because its approximation is based only upon the “uninformed” factor- 
ization (g(t) = 1), which allows r to be determined with only minimal knowledge 
of the spring potential law. Therefore, one’s choice of approximation depends on 
the application of interest. 

In this Note we provide two different approximations to (14). From Observation 
3, we know that these approximations are good for y close to y,; however, our 
proof shows that these approximations may be suitable over larger intervals. 
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Hopping Hoops Don’t Hop 


James P. Butler 


The dynamical behavior of a massless hoop with an attached point mass under the 
influence of gravity is an old chestnut, with some surprising features, including the 
question of whether or not it can “hop”. It appears in Littlewood’s Miscellany [1] 
and most recently in this MONTHLY [2]. That there are many interesting aspects of 
this problem was suggested to me many years ago by the late Prof. J.C. Miller of 
Pomona College, who may have gotten a hint of the trickier parts directly from 
Littlewood himself. The purpose of this Note is to show that Littlewood’s and 
Tokeida’s asserted solution is wrong, even with a more realistic hoop and more 
realistic friction, and to suggest some approaches to a self consistent investigation 
of the rolling hoop. 

In its simplest form, the problem asks for the behavior of a rough massless hoop 
of radius R with a point mass M attached to its rim, rolling in a vertical plane on a 
level floor under gravity. Implicitly in Littlewood’s problem, and explicitly in this 
Note, we understand the concept of massless to be the limit of a positive hoop 
mass tending to 0; otherwise the rotational behavior of the hoop in free fall is 
undefined. The idea of “rough” is central to the problem; it is (roughly) defined 
as a no-slip constraint at the point of contact of the hoop and the floor. Let 
3 € [0,7] be the angle from the radius vector to the mass, measured from vertical; 
other ranges of 3 are not considered in this Note. If the total energy is equal to 
the gravitational potential energy of the mass one diameter above the floor, the 
“solution” to the problem is the assertion by Littlewood that the hoop “lifts off 
the ground” or by Tokeida that the mass “pulls the hoop up” at 0 = 7/2; this 
particular value of & depends on the total energy, but is especially simple in the 
case cited. Denote the normal force conferred by the floor on the hoop by n. The 
hopping conclusion is alleged to follow from the observation that, for zero kinetic 
energy at 0 = 0 and for the rim constrained to be in no-slip contact with the floor, 
n > 0 for 0 < 7/2 and n < 0 for 3 > 7/2. The evaluation of n(d) is elementary 
using Newton’s second law and conservation of energy (the rough condition is 
conservative). An equivalent argument [2] is that the motion of the point mass 
either follows a cycloid for 0 < 3 < 7/2, or its free fall parabolic preference for 
17/2 < 8; here the upper limit was not specified. 

This solution is wrong on both mathematical and physical grounds. To show 
this, we begin with the equations of motion, with the following notational conven- 
tions. Nondimensionalize the problem by measuring distance in units of radius R, 
mass in units of M, and time in units of ~R/g, where g is gravitational 
acceleration. In these units, gravitational acceleration is —1. The horizontal and 
vertical coordinates of the point mass are x = 0+ sind and y=1+cosv, 
respectively. The kinetic energy is given by (1/2)(x” + y*) = 07(1 + cos &), where 
the overdot denotes the time derivative, and the potential energy by y = 1 + cos@. 
With this notation, Newton’s law takes the form 


jy =n —1= —cos(0) 6? — sin(@) 0. 
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If we take the total energy to be 2 (the potential energy at & = 0), the conservation 
of energy yields 


6°(1 + cosd) =1—cos#. 


From these two equations, it is easily verified that n is positive for 0 < 7/2, zero 
for 0 = 7/2, and negative for 0 > 7/2; since the floor cannot exert a negative 
normal force on the hoop, one concludes (incorrectly) that the hoop hops past 
0 = 17/2. 


Theorem. Littlewood’s hoop doesn’t hop. 


Proof: Let the coordinates of the hoop’s center be x,, y.. For the hoop to hop, y. 
must be greater than one (radius) above the floor (as long as the hoop is in contact 
with the floor, y.=y —cos8= 1). The proof that this cannot happen is by 
contradiction. Assume that past 0 = 0, = 7/2 the hoop hops, implying that the 
mass falls freely, then solve for y., and show that it cannot be greater than one. 
Let t) be the time at which 3 = 0, and n = 0. Assume that for t) <¢ <t,, the 
hoop hops, meaning y, > 1. In this interval, the point mass is in free fall; the 
equations of motion are different from those with rough contact and a positive 
normal force. In particular, the vertical coordinate of the mass is given by 
Y =Yo + Volt — ty) — 1/2Mt — ty)” and & is given by uniform angular rotation 
0 = 0, + 3(t — t)), where the subscript 0 indicates the quantity evaluated at 
= t) . During the hop, therefore, 


y. =y + sin(d)d =, — (t —t)) + sin(d)%, 

¥. =¥ + cos(d) 3? + sin(d)d = —1 + cos(#) 82. 
At the beginning of the hop, y,(t)) = 1, y.(t)) = 0, and importantly, j.(¢t)) = —1. 
It follows that for the open time interval t) <t<t,, y, < 0. Over this interval 


therefore, the velocity of the center of the hoop is negative, i.e., it must anti-hop, 
or push through the floor. This is inconsistent with the assumption of a hop. les 


What this theorem shows is that Littlewood’s problem is singular in the sense 
that there is no solution past ft, consistent with Newton’s laws, an impenetrable 
floor (y, = 1), and no-slip (x, = &) when n = 0. What then does the real hoop 
do? If we retain Newton’s laws and an impenetrable floor, then the no-slip 
condition must be violated when n = 0, implying that the hoop skids. One must 
check that the skidding solution is self consistent; that this is true is sketched in the 
following arguments. 

There are several issues and questions raised by Littlewood’s problem. First, 
with respect to the hopping conclusion, there is the explicit error in not using the 
different equations of motion for the different periods. Equivalently, the idea of 
simultaneously using a constraint such that y, = 1 together with n < 0 to argue 
that y, > 1 is clearly inconsistent. Second, the concept of “rough” contact is not 
well defined. As we have argued, we may retain a no-slip condition for two objects 
in contact with n > 0, but not for n = 0; in what follows we define no-slip 
conditions only for n > 0. Third, what then happens to such a real hoop during the 
skid phase? Can it subsequently hop? Fourth, if we impose realistic frictional 
conditions rather than rough ones, can such a hoop hop without skidding, or can it 
hop following a skid phase? | 

It is not the purpose of this Note to give complete answers to these questions, 
but a few points can be made. The idea of a point mass and a massless hoop 


566 NOTES [June-July 


suggests a second type of singular character to the equations of motion. This 
singularity manifests itself as a reduction of the number of equations of motion 
from three to two when the moment of inertia about the center of mass is zero, 
and as a concomitant reduction in the number of degrees of freedom for the 
frictional force from two to one (the vector friction force must point in the 
direction of the point mass). In approximating the behavior of real hoops, it is thus 
appropriate to consider distributing the mass on a real hoop so that its center of 
mass is at a radial distance A < 1 and its moment of inertia about that point is 
I > 0; we must investigate the hoop’s behavior for A near 1 and for J near zero. 
With coordinates x, y generalized to the center of mass (x =x, + Asin 3, y = y, 
+ Acos 3) and denoting the tangential frictional force of the floor on the hoop by 
f, the equations of motion, valid irrespective of no-slip, skidding, or free-fall 
conditions, are 


taf, 
jon-1, 
IS = n(x —x,) — fy =nAsin 9 — f(1 + Acos ®). 

The no-slip condition remains x, = @. It turns out that in this case as well, our 
Theorem remains true. The proof by contradiction is similar, and may be sketched 
as follows. When the hoop hops, we have n(t)) = 0, j(tp) = —1, ¥(to) = -1 + 
Acos(9,)3?7 + Asin(d,)3, = 0, and Vty) = —1 + Acos( )8?. Eliminating 3 
from the equations of motion (written in terms of 0) easily shows that f < 0 and 
0 > 0 when n =0. Comparison of y.(t)) with y.(tj) then shows y.(tj) < 0, 
which as before implies a contradictory anti-hop and therefore a skid. That the 
skid phase is well defined is shown by the existence of a finite f consistent with 
keeping the hoop on the floor. 

Having established that (at least for & € [0,7]) an ideal hoop and its real 
cousin must skid if the normal force goes to zero, we now ask for a little more 
reality in the friction law. This might be Coulomb friction, with skidding occurring 
at some critical ratio of the tangential and normal force. One might even include a 
distinction between the ratio at which skidding begins (static friction), and the ratio 
during the skid (dynamic friction). In any case, the physics is no longer conserva- 
tive, and the absence of an energy integral means the equations are more difficult 
to analyze. Dee some general conclusions may still be drawn. (A) Is the 
skid forward (3 >x,: hoop is rotating faster than Hceessary for no-slip (an 
accelerating train with slipping wheels)), or backwards (3 < x,: hoop is rotating 
slower than required for no-slip (a decelerating train with slipping wheels))? The 
answer is that both behaviors are possible. Starting from @ near zero, the initial 
tangential force is clearly positive, and if the hoop is sufficiently greased, f/n can 
match the static friction; the hoop skids forward. If it’s not, there is necessarily a 
backwards skid with f <0. The sign of f at the beginning of the skid must 
therefore be considered. (B) While skidding, is a hop possible at some ¢,? There 
are several possibilities here. (B1) Call a hop a “smooth take-off” if it occurs when 
n = 0. During the skid |f/n| is fixed for Coulomb friction, and thus a smooth 
take-off requires f = 0 also. In this case (by analogy with the sketched proof), 
O(t;) = 0, ¥t7) =¥,Ctt) = 0, but H.(¢7) = —Asin(9,)d) < 0, again implying an 
anti-hop. (B2) Call a hop a “semi-smooth take-off” if it occurs when the skidding 
hoop catches up to the floor, and dynamic friction switches over to static friction. 
That this additional complication doesn’t change matters is easily seen by the 
observation that a change in force to a different, but finite, value for an infinitesi- 
mal amount of time cannot change the subsequent dynamics. (B3) What remains is 
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the possibility of a “jump hop’, associated with impulsive forces, approximated by 
delta functions in “stick-slip” dynamical problems such as chattering chalk on a 
blackboard. This phenomenon appears to be the most likely candidate for the 
origin of the real hop of real hoops, but raises more complicated questions about 
the physics, and makes the analysis commensurately more difficult. 

This analysis of the hopping hoop leads to several conclusions. First, and most 
important, is that Newton’s laws and the kinematical constraint for “rough” contact 
are in general inconsistent when the normal force is zero. Second, real hoops that 
hop must skid first, and the subsequent hop cannot be smooth nor semi-smooth. 
Third, there is a rich structure in the behavior of real hoops: vary A and J, vary the 
initial conditions, let 0 be unbounded, follow the bounce(s) after the hop. Finally, 
with respect to this isolated singularity in Mr. Littlewood’s Miscellany, he did say 
that in practice, “the hoop skids”, but seemed to imply this to be due to a realistic 
friction law rather than a necessary consequence even with an unbounded coeffi- 
cient of static friction. The answer to his query whether the behavior of the hoop is 
intuitive is given by the following 


Theorem. The behavior of hopping hoops is not intuitive. 
Proof: By inspection. 
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Approximation of Holder Continuous 
Functions by Bernstein Polynomials 


Peter Mathé 


In a recent MONTHLY [5], a special instance of the Weierstra8 approximation 
theorem attracted attention: approximation of real Lipschitz functions on [0, 1] by 
Bernstein polynomials 


n | a j 
(fx) = b[")xia 2)" y(2). 
j 

The authors of [5] provided a rate of uniform convergence of B,(f,-) to f using 
large deviations techniques. It is the aim of this note to discuss the optimal rate of 
approximation with some historical remarks. More generally we consider the class 
Lip,(L) of Hélder continuous functions with exponent a for some 0 < a < 1 and 
constant L, 1.e., functions that obey 


|f(x) —fl(y)| <L|x -—y|° forall x, y © [0,1]. 
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We present two simple proofs of the following 


Theorem 1. If the function f :[0,1] — R is Hélder continuous with exponent a and 
constant L then 


f(x) — BLCf, sis t(" = " |" foralln € N andallx € [0,1]. (1) 


In contrast to Gzyl and Palacios [5], who refine the original Bernstein argument 
to treat the specialization to Lipschitz functions, we use more direct eee 
that are elementary and i improve the rate to the optimal one. 

The history of this result is worth recounting. Bernstein first used his epony- 
mous polynomials to prove the Weierstra8 approximation theorem in 1912 [2]. It 
took more than twenty years before results concerning the rate of convergence of 
Bf, ) to f appeared, by Popoviciu [9] and by Kac ([6] and [7]). While Popoviciu 
established speed of convergence in terms of the modulus of continuity, Kac 
originally proved exactly Theorem 1. The first proof we present is in the spirit of 
Kac and exploits the probabilistic meaning of the Bernstein polynomials. The 
second version uses Korovkin—type arguments [1], as in the nice exposition of the 
Weierstra8 approximation theorem in [10, Chapter 1.2]. Other approaches are 
reviewed in Lorentz’s monograph [8]. 

The probabilistic nature of the Bernstein polynomials can be recognized when 


interpreting the weights [" |via —x)"! as point probabilities of a binomial 
distribution with parameters n and x. There are, however, several ways to realize 
this distribution. To establish a connection with empirical distribution functions, 


given n, let u,,...,u, be random variables that are independent and uniformly 
distributed on [0, 1]. Consider the random function S, :[0, 1] — [0,1] defined by 


1 n 
S,(*) = Bs os X10, x) 4;)s X& [0, 1], 
j=l 


where Yo, ,) denotes the characteristic function of the interval [0, x). S,, takes only 
the values j/n, j = 0,..., 7, with probabilities P(S,(x) =j/n) = j x/A — x)", 


Thus at any point x the random function value nS,(x) is binomially distributed 
and we have expectation ES,(x) =x and variance 


po: ACLS 
E(S,(x) x)= “A. (2) 


nN 


Proof of Theorem 1: probabilistic version. By construction B,(f, x) = Ef(S,(x)). 
Using the triangle inequality and Holder continuity we obtain 


| | f(x) — Bf, x) | = E| f(x) — f(S,(+))| = LE|x - S,(x)|. 
Apply the Holder inequality with parameters 2/a and 2/(2 — q@), and arrive at 


f(x) — B,(f,x)| < L(E|x — (x) )°" 


Finally, use the variance formula given in (2) to obtain (1). 7 


Proof of Theorem I: analytic version Here we consider the mapping 


Fe C([0,1]) > B,(F,) € C(L0, 1), 
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which is a bounded linear operator that maps nonnegative functions into nonnega- 

tive polynomials (and thus is positive). Moreover, letting p)(x) = 1, p(x) =x, 
p(x) =x? denote the monomials of degree at most 2, it is easy to check that 

B.A Po> x) = 1, B,( P1,*) — Xx; B,( Pa> x) x? a x(1 = x) /N. (2.5) 

Denote f,(y):=L|x—y|*. Thus for any function f € Lip,(L) we have 


—f(y) < f(x) — fly) <f,(y) for all y € [0,1]. Applying B, and using its posi- 
tivity we obtain | f(x) — B,(f, y)|<B,(f,, y). Hence, for y =x we arrive at 


f(x) ~ B,Cf, *)| < BuCfas 2). 
As a substitute for the Holder inequality the operator B, obeys 


B,(\g|,x) <(B,(lgl’".x))” (3) 


for any function g. Indeed, the geometric—arithmetic—mean inequality 


1 
oP VP < —og+(1-1/p)t, p>1,0,7=0, 
P 


2 a a/2 2/a = 
p=—,7=(B(lg/",x))° , and o=|g(y)[°r'-7/* for y © [0,1] 


yields 
1 2/a 2/a a/2-1 p— 1 2/e a/2 
le(y)| < Picicen (B,(\g |", x)) + — (Balle ,x)) 
Using the positivity of B, we finally arrive at 
eo a/2 a a/2 
B(Igl.x) < 1/p(B(Igh".x)) + (1 - 1/p)(B,(1gP", x) 
a/2 
=(B,(Igf/*.x))"" 
which proves (3). Thus, for g := f, we obtain 
VG) HBG) a(R a) (3.5) 


Using the linearity of B, and its values at py, p,, and p, given in (2.5), and 
observing that f2/*(y) = L?/*(x? — 2xy + y”), we conclude that B,(f?/%, x) = 
L*/*x(1 — x)/n. Substituting this equality into (3.5) completes the proof. ie 


For completeness we derive the asymptotically exact behavior of the error. The 
accuracy of the approximation of H6dlder continuous functions by Bernstein poly- 
nomials is quantified by 


e,(L,a) = sup sup |f(x) — B,(f,x)]. 
feLip (ZL) x€ (0, 1] 


Theorem 1 can thus be rewritten as e,(L, a) < LU /(4n))*”. This is refined in 
Theorem 2. lim, ,..n°/7e,(L, a) = L2-*”T(a + 1)/2)/ Vr. 
Kac proved that the rate of convergence cannot be improved on the class of 


Holder continuous functions by establishing an appropriate lower bound for the 
specific function f,,. € Lip,(L). For this particular choice we have 


e,(L, a) =| fi.(1/2) — B,(fi2.1/2)| = LE|S,(1/2) - 1/2)". (4) 
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Popoviciu. derived the exact asymptotics of the mean absolute deviation 
E|S,(x) —x|, ie., the case a = 1, by providing the explicit representation 


Els,(x) -x]=2(" 7 "ea -y, (5) 


where j is the unique integer for which x € [j/n,(j + 1)/n). This representation 
has a long history; see [4] for a proof and historical details, beginning in 1730, 
when De Moivre gave a similar explicit formula for the case x = 1/2. The 
derivation of (5) is combinatorial and the analysis of the asymptotic behavior is 
non-elementary. Our arguments do not rely on the representation (5). Instead we 
again try to keep close to probabilistic arguments and use the De Moivre—Laplace 
Theorem on the binomial approximation of the normal distribution. 

Our proof of Theorem 2 requires basic knowledge of probability theory and real 
analysis. For convenience we split the proof of Theorem 2 into two lemmas. As the 
lower bound in (4) suggests we are concerned with the asymptotic behavior of 
n*/?E|S§ (1/2) — 1/2|", and more generally we study the functions g,(x) = 
n°*/?E|S (x) —x|*. Given a we define K, = 2*/T(a + 1)/2)/ Va and g(x) 
= (x(1 — x))°/*K,,. 


Lemma 1. lim, ,..g,(x) = g(x) for every x € [0, 1]. 
Moreover the functions g, belong to Lip, ,,(1) as can be seen from 
Lemma 2. |g,(x) —8,(y)|<|x —y|°” forall n © N and all x, y € [0,1]. 


Suppose for a moment that both lemmas have been proved. Pointwise conver- 
gence from Lemma 1 and uniform Holder continuity from Lemma 2 imply uniform 
convergence as stated in 


Proposition 1. lim, _, ..SUP, <1, 4|8,(*) — g(*)|= 0. 


Proof: Our arguments follow the usual proof of the Arzela—Ascoli Theorem. In 
fact, for any m € N the set {j/m, j = 0,..., m} is finite so that Lemma 1 provides 
some 7, such that 


1 a /2 
al/m) — gi/m)| < {== ) J=0,...,m,n = No. 


Thus for an arbitrary x € [0,1] let j, be chosen such that |x —j,/m|< 1/(2m). 
Then we conclude, using Lemma 2, 


lg.(x) —8(x)| <|8,(%) — 8n(i./m)| +\8n(i./m) — gCi,/m)| 
| +|g(j,/m) — g(x)| 
<3 : \" 


2m 


for n =n, and all x € [0,1]. Since m may be chosen arbitrarily large we have 
established uniform convergence. Ei 


Now the proof of Theorem 2 follows easily. 
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Proof of Theorem 2. We infer from (4) that n*/7e,(L, a) > Lg,(1/2). Thus Lemma 
1 yields liminf, ,.n°/e,(L, a) => Lg(1/2). On the other hand, interchanging 
SUP fe Lip (Ly and sup, <1, 1) in the definition of e,(L, a) gives 


Pe(L,a)=n*/ sup sup |f(x) —B,(f,x)| 
x€ [0,1] feLip CL) 


<Ln*/ sup Elx—S,(x)| =L sup g,(x). 
xe [0,1] x€(0, 1] 


Since 
sup g,(x) — sup g(x)|/< sup |g,(x) — g(x), 
x€ [0,1] x€[0, 1] x€([0, 1] 


Proposition 1 implies that sup, <,0,8,(*) > SUP, <),g(*%) as n—%, Thus 
limsup, ,.1°/*e,(L, a) < Lsup, <0, ,8(*) = LgQ./2). is 


We turn to siaets of the lemmas. 


Proof of Lemma 1. Since the random variable nS$,,(x) is binomially distributed for 
every x, we may apply the De Moivre—Laplace Theorem, which asserts that 


nS,(x) — mx Lega 
a < = aaa. <6) - ae = fe °/2 dy 


for all a < b. Especially for all t > 0 this yields 


{|p += fe WEP dy, (6) 


Furthermore, we may use Chebyshev’s inequality to conclude that 


‘ 


P 


ns nS,(x) — nx 


=) 


nS (x) — nx 


| ynx(1 — x) 1 
t]) < ————__ = 5, 
i t? 


nS,,(x) — nx : 
ynx(1 —x) 

| nS,,(x) — nx 
\| ynx(1 — x) 


We recall that g,(x) =n*/E|S,(x) —x|". Stieltjes integration allows us to 
compute 


hence 


> | < min{1, (7) 


,(x) = af ¢'P(va|S,(x) —x| >t) dt 


nS,(x) — nx 
TAD AE > 5 a 


ahaa | Pe 
(x(1 =x)" f n> 
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Since fjt* ‘min{1, t~*} dt < © and we have pointwise convergence as in (6), the 
Lebesgue Dominated Convergence Theorem implies 


lim g,(x) = lim af t°-'P(va|S,(x) —x| >t) dt 
n> n-© 0 


ee 
— a/2 a —s? /2 
(x(1 —x)) Js Ze ds 


2 00 
= (x(1 2) j — amen | oP (2A e-" Ay 
T 0 


T((a@a+1)/2 
CAD «5, : 


It is evident from the proof that K, is the ath absolute moment of the standard 
normal distribution, i.e., K, = E|y|°, where y is standard normal. This could also 
be guessed, since the De Moivre-Laplace Theorem may be viewed as a special 
instance of the Central Limit Theorem. 


=a) 27 


Proof of Lemma 2. Since ||a|* —|b|*|<|a — b|* for all a,b € R and 0 <a <1, 
we have 


\gn(x) —8,(y)| <n*7E|(S,(x) — x) - (S,(y) -y)| 
< n*/(B\(S,(x) - x) - (S,(y) ~y)/)” 
= (nVar(S,, (x) -—S Ay). 


Now suppose x < y and recall that S,(x) — S.(y) = -rt- Xtx, y64j), Which is 


the sum of n independent random variables. It is well known that then the 
variance of the sum equals the sum of the variances, which implies 


a/2 


lg.(x) — g,(y)| < [revar{ xi.) = (Var( Xx, )(4))) 


=((y-x)(1-(y-x))) sly -x | w 


We have thus proved the lemmas and hence Theorem 2. We admit that deriving 
the exact asymptotics of the error is much more elaborate than the simple 
arguments leading to Theorem 1. The gain in accuracy is less than ¥2/77. 

To complete our development of properties of the Bernstein polynomials using 
the probabilistic representation, we add the row result, originally proved by 
Brown, Elliott, and Paget [3]. 


Proposition 2. If the function f :[0,1] — R is Holder continuous with exponent a 
and constant L then so are the corresponding Bernstein polynomials B_(f, - ). 


Proof: Use the triangle inequality, Holder continuity, the representation of S,(x) 
as an empirical distribution function, and finally the Holder inequality to derive for 
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x <y the estimates 


| BAC f, x) — B,(f, ¥)| < E| f(S,(x)) — f(Si(v))| < LE|S,(x) a SiCy)| 


a 


1 ° ee 
= LE|— 2 xt, (4)} SLlE]— Lo x2, (4) 
n= fice 
j=l p=s 
<L|x-y]*. = 


This proof is brief and elementary, but heavily uses the specific realization of the 
S,(x) through an empirical distribution function, thereby correlating the random 
variables S(x) and S,(y) properly. This was also crucial for proving Lemma 2. 
The simple arguments leading to Theorem 1 did not rely on any specific realiza- 
tion. 
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An Extension of the Wallace-Simson Theorem: 
Projecting in Arbitrary Directions 


Miguel de Guzman 


1. THE WALLACE-SIMSON LINE. The Wallace line has been a popular object 
of study for many geometers during the two past centuries. Let us start by recalling 
the theorem. 


The Wallace-Simson Theorem. Consider a triangle ABC. The locus of all those points 
P in its plane such that the orthogonal projections of P on the three sides of the triangle 
are collinear is the circumcircle of ABC. The line of the projections is called the 
Wallace-Simson line of P with respect to ABC. 
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Poncelet attributed this theorem to Robert Simson (1687-1768), among others, 
since it had the flavor of the geometrical properties that Simson was interested in. 
Almost certainly it is actually due to Wallace, another Scottish mathematician of 
lesser importance, who discovered it in 1799. 

The beauty of this theorem, which shows a somewhat unexpected and surprising 
relationship between a triangle and its circumcircle, attracted many geometers in 
the nineteenth century, among others Jakob Steiner, and led to the discovery of 
many beautiful properties. Among the most surprising are those connected with 
Feuerbach’s circle, Steiner’s deltoid, and Morley’s triangle. A reader interested in 
making an excursion through this landscape can find a guide in the references at 
the end of this article. Particularly interesting for its numerous references to the 
older literature is F.G.-M.’s work [7, p. 329]. The initials F.G.-M correspond to 
Fr. Gabriel-Marie, who signed his published works this way. 

It is rather simple to obtain a direct proof of the Wallace theorem. One has just 
to prove that, in Figure 1, one has 2 BVU = Z AVW. Observing that quadrilateral 


Figure 1 


PVUB is cyclic because of the right angles at U and V one gets 2 BVU = 2 BPU 
= 90° — Z PBU = 90° — Z PBC. In the same way PVAW is also cyclic and there- 
fore ZAVW = 2 APW = 90° — Z PAW = 90° — (180° — Z PAC) = Z PAC — 
90°. But it is quite clear that 2 PAC — 90° = 90° — Z PBC since the angles at A 
and B are supplementary in cyclic quadrilateral PBCA, and so 2 BVU = Z AVW, 
which shows that U, V, and W are collinear. a 


2. AN EXTENSION OF THE THEOREM. The goal of this Note is to present a 
generalization of the Wallace theorem along a strikingly simple direction that I 
have never seen explored. I first state the theorem and discuss its proof and then 
present some nice exercises that can be solved in a simple way. 
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Theorem. Consider a triangle ABC. Let us call a,b,c, the corresponding sides 
opposite the vertices. We fix three projection directions a, B, y, not all three equal, and 
such that a is not parallel to side a, B is not parallel to side b, and y is not parallel to 
side c. Take an arbitrary point P in the plane of ABC and project it on a along a 
obtaining U, on b along B obtaining V, and on c along y obtaining W as in Figure 2. 
Fix a real number k and an orientation in the plane in order to give a sign to the areas 
of the triangles we consider. 


Figure 2 


Then the locus of all points P such that the oriented triangle UVW has area k is a 
conic C(k). It is clear that C(O) always goes through the three vertices A, B,C (for 
example, with P at A, V and W are also at A). The conic C(k) can, of course, 
degenerate in various ways. 

When k varies (with fixed a, B, y), the family C(k) is always a family of conics 
with the same points at infinity. Furthermore, if one of them has a center, all others 
have the same center and are homothetical to each other (except the possibly degener- 
ate elements of the family C(k)), the homothecy center being the common center of all 
such conics. If none has a center, then they are all translations of the same parabola 
along the direction of its axis. | 

The construction, with straightedge and compass, of the common center (when it 
exists) and of the axes and asymptotes of the conics of the family C(k) is easily done 
once one knows A, B,C and the projection directions a, B, y. 


Proof: The theorem is one of those results whose only difficulty is arriving at its 
statement, since the easy analytical proof we develop could be left as an exercise to 
the reader. 


Let us begin by fixing an arbitrary cartesian system of coordinates and an 
orientation to give a sign to the areas of the triangles. If the point P has 
coordinates (x, y) and we denote by a, B, y three vectors that correspond to the 
given projection directions, it is clear that the points U,V,W have as coordinates, 
respectively, 


(u,(x,y), U(x, y)), (Ui(% y),2(% y)), (w,(x, y),W2(x, y)) 
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where each of these functions is a linear function in x, y with coefficients that 
depend only on the parameters (already fixed) that determine A, B,C, a, B, y, in 
our coordinate system. 

The area of the triangle UVW is given by half the value of the determinant of 
the matrix 


ui(x,y) Un(x,y) 1 
vi(x%,y) (x,y) 1 
wi(x,y) w(x,y) 1 
Therefore the equation of the locus is of the form | 
mx? + ny? + 2pxy + 2ax + 2. 4+:5 = 2k (1) 


where m,n, p,q,r,s depend only on the fixed entities A, B,C, a, B, y. This 
shows that C(k) is a conic and that, when k varies, C(k) is a family of conics 
whose points at infinity are the same, since they are determined by 


mx? + ny* + 2pxy = 0. (2) 


The equation (2) cannot degenerate, i.e., m,n, p cannot be 0 simultaneously. In 
fact, in that case C(O) would have as equation 2qx + 2ry +s = 0. If one of the 
coefficients g, r is not zero, then C(O) is a straight line, which is false, since 
A,B,C are points of C(O). If g=r=0 then the equation of C(O) is s = 0. 
Therefore, if s is not 0, C(O) is empty, which is again false. If s =O then 
C(0) = R’, which easily implies that all three directions a, B, y, are the same; this 
was excluded from the beginning. 

Therefore, C(k) is a family of conics with the same points at infinity, as 
announced in the statement of the theorem. If we take as the origin an arbitrary 
point O and as coordinate axes the two (always real) bisectors of the angles formed 
by the lines joining O with the points at infinity, then the equation (1) takes 
the form 


mx* + ny? + 2qx+2y +s =2k (3) 


(where m,n, q,r,s, of course, need not be the same as before). 

If for some k the conic C(k) has a point Z as center and we take Z as the 
origin and as coordinate axes the axes of the conic (the two bisectors of the angle 
obtained by joining Z to the two points at infinity), then the equation (3) takes 
the form 


mx? + ny? + =2k (4) 


with m,n, s depending only, as before, on A, B,C, a, B, y, which makes plain the 
simple structure of the family C(x). All, except for s = 2k, are concentric and 
homothetic with respect to Z. For s = 2k the conic C(k) degenerates and 
becomes either a pair of real lines (which can coincide) or a pair of imaginary lines 
that intersect at Z. 

If in equation (4) it happens, for example, that m = 0 then the equation of 
C(k) would be ny? + s = 2k, which is, for each k, a pair of parallel lines, real or 
complex according to the value of k; this pair of lines becomes one double real 
line if s = 2k. It is clear, in this case, that any point of such a double line is a 
center of symmetry of each C(k). 

If no C(k) has a center, then in the equation 


mx? + ny? + 2qx + 2ryt+s=2k 
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we have either m = 0 or n = 0. Assume n = 0. Then 
mx? + 2qx + 2r7y +s =2k 


and if we make the change x = X — q/m (ie., a translation of the y axis) we 
obtain 


mX* + 2rn +t =2k 


with m,r,t depending only, as before, on A, B,C, a, B,y. The coefficient r 
cannot be null, since if r = 0 we would have mX? + t = 2k, and this is, for each 
k, a pair of parallel lines and thus has a center. Therefore, r is not null and so 
C(k) is, in this case, a family of parabolas which are all obtained as translations of 
one of them in the direction of its axis. 


Construction, with compass and straightedge, of the center, axes and asymptotes 


of C(k). 


Here we indicate a practical construction for the main elements of the conics 
C(k). According to the previous part of the theorem it is sufficient to find these 
elements for C(O), of which we already know three points A, B,C. 

If two of the projection directions coincide, for example 8B = y, it is easy to 
show in a direct way that C(0) is the line BC together with the line through A in 
the direction B = y, independent of the direction a. It is also easy to show that 
coincidence of two of the projection directions is a necessary and sufficient 
condition for degeneration of the conic C(O) into two lines (one of them is the line 
containing one of the sides of the triangle and the other is the line through the 
opposite vertex in the same direction as the two coinciding ones). It is then quite 
clear that, if 8 = y does not coincide with the direction of the side BC, then 
C(k), with k different from 0, is a family of hyperbolas having these two lines as 
asymptotes that are homothetic to each other with respect to their common center, 
the intersection of those two lines. When B = y has the same direction as the side 
BC, each C(k) is a pair of parallel lines (that coincide for k = 1/2) and parallel to 
those that constitute C(O). Thus, we have determined the main elements of C(k) if 
C(0) degenerates. 

If no pair of projection directions coincide, then C(O) is a non-degenerate conic 
for which we already know three points. Another three are easily determined in 
the following way: 


Figure 3 
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Through B we draw a line parallel to y and through C a line parallel to B; this 
makes V = C and W = B, and UVW degenerates; it is clear that the intersection R 
belongs to C(0). Through C we draw a line parallel to a@ and through A a line 
parallel to y; similarly the intersection S belongs to C(O). Through A we draw a 
line parallel to 8 and through B a line parallel to a; the intersection 7 belongs 
to C(0). 

We thus obtain a hexagon ATBRCS whose vertices are in C(0) and such that its 
opposite sides are parallel. It is clear that the midpoints of the sides of a pair of 
opposite sides are on a diameter of C(0). We thus obtain the center of C(O) (when 
it does exist) as intersection of these three diameters, DG, EH, FI (see Figure 3). 
When the center does not exist we obtain at least the direction of the axis of the 
parabola C(0). 

Assume for the moment that there is a center as indicated in Figure 3. Since, we 
already have three pairs of conjugate diameters, the construction, with compass 
and straightedge, of the common asymptotes and axes of all conics C(k) is a well 
known exercise. 

For the determination of the vertex of the parabola C(0) when the center does 
not exist, we proceed as follows. Through A we draw a line perpendicular to the 
direction of the axis, which we already know. We determine its intersection A’ 
with the parabola; the bisector line of AA’ is the axis of C(0). The intersection of 
this line with the parabola gives us the vertex. The parabolas C(k) are the 
translations of C(O) in the direction of its axis. 


3. SOME INTERESTING EXERCISES RELATED TO THE THEOREM. The 
theorem just proved suggests many interesting exercises, which can be solved easily 
with the methods we have used in the proof. Here are a few with some indications 
of the way to solve them. Fix a triangle ABC. 


1. Find necessary and sufficient conditions on the projection directions in order that 
C(0) and C(k) are circles. (Of course, C(O) will be the circumcircle and all C(k) will 
be concentric to it.) 

Hint: Consider the inscribed hexagon in C(O) that we used in the construction of the 
main elements of C(0). The orthogonal projections to the sides (the Wallace Theorem) 
are not the only possibilities in order to get circles. 


2. Determine necessary and sufficient conditions on the projection directions in 
order that the conics C(k) are equilateral hyperbolas. 

Hint: Remember that a necessary and sufficient condition for a conic that goes 
through three points A, B,C, to be an equilateral hyberbola is that it also goes through 
the orthocenter of ABC. 


3. For two different projection directions, determine a third projection so that the 
C(k) are equilateral hyperbolas. 
Hint: See the hint for Exercise 2. 


4. Given three directions a, B, y, find, if possible, a point F such that the triangle 
MNP with vertices at the projections of F on the sides a, b,c in the directions a, B, y 
has maximum or minimum area. 

Hint: Consider the equation of the conic C(k) with respect to its axes. 


5. Given a non-degenerate conic C and the triangle ABC inscribed in it, determine 
three directions a, B,y such that C is the conic C(O) for ABC and the three given 
directions. 
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Hint: Remember the hexagon inscribed in the conic that we used in the construction 
of the elements of C(O). 


6. Assume that triangle ABC has area S and that the radius of its circumscribed 
circle G is R. We draw a circle K concentric with G and with radius r. From a point P 
of K we draw its projections U,V, W on the sides of ABC. Determine, as a function of 
S, R, and r, the area of the triangle UVW. 

Hint: The same as in Exercise 4. Answer: Area(UVW) = (S/4)(. — r?/R7), 
having selected the appropriate orientation so that the triangle UVW has positive area 
when r < R. 
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Another Short Proof of Ramanujan’s 
Mod 5 Partition Congruence, and More 


Michael D. Hirschhorn 


We present another novel short proof of Ramanujan’s partition congruence 
p(5n + 4) = 0 (mod 5) (1) 
in addition to that presented by John L. Drost [2], and indeed prove rather more. 


Ramanujan made the remarkable observation from a table of values of p(n), 
the number of partitions of n, that p(Sn + 4) is divisible by 5. He observed and 
conjectured much more, and his conjectures turned out in the main to be correct. 
He gave a simple proof, based upon identities of Euler and Jacobi, of the 
conjecture (1), and his proof is essentially the one reproduced in Hardy and Wright 
[3] and referred to by Drost. Ramanujan’s proof relies on manipulating power 
series, and considering coefficients modulo 5. It is my intention to give a proof of a 
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similar sort, more transparent than that of Ramanujan, using only the identity of 
Jacobi. And further, with a little extra work including the use of Jacobi’s triple- 
product identity, we prove remarkable congruences for the partition function due 
to Atkin and Swinnerton-Dyer. 


As is usual, write (q),, = I1,,, — q”). Then 
1 


2 (ma - (Q)e 


We begin with Jacobi’s identity [3, Theorem 357], 
(q)o= DL (-1)"(2n + grey? 
n>=0 
Each coefficient is congruent modulo 5 to 0, + 1 or +2. Specifically, the coeffi- 
cient is congruent to 1 when n = 0 or 9 (mod 10), —1 when n = 4 or 5 (mod 10), 
+2 when n = 1 or 8 (mod 10), —2 when n = 3 or 6 (mod 10), and 0 when n = 2 
or 7 (mod 10). Thus we find that, modulo 5, 


‘a 10n(10n +1) /2 10n+4)(10n+5) /2 10n+5)(10n+6) /2 
(q.= )¢g Us ge )(10n Mie: gs )(10n + 6) / 


n>0 n>=0 n>0 
ae \_ q(l0n+9)(l0n+10) /2 9 _ gq(10n+1)0n +2) /2 = > gion +3)(lon +4) /2 
n=O n>0 n>0 

a) ba qlont+6)d0n+7) /2 +9 q(l0n+8)lon+9) /2 

n>=0 n>=0 

= > Yi 23 > i 7 y goon reams di > gee 
n>=0 n>=0 n>0 n>=0 
2 2 2 

ae) ye qe +15n+1 _ ,) » qe +35n+6 __ -) y ae +65n+21 

n=O n>0 n>0 
Hi) y- gion’ +85n+36 

n>=0 


Observe that in the first four sums the powers of qg are congruent to 0 (mod 5) 
while in the latter four sums the powers of qg are congruent to 1 (mod 5). Thus we 
have 


(q):. =X+2qY, 
where each of X,Y is a series in powers of q?. 


Also 
(q)2 = []Q-4") = [TI (i - 5q" + 10g?" — 10g" + 5q*" — q*") 
n>1 n>1 
= Th - a") = @)e. 
Thus 
3 3 
‘ 1 (q)s ((q)x) 7 ((4)z:) — (xX + 2qY)° 
» p(n)q = = io. a 5 oe ee ; 
n>=0 (4 ow (9 )ec ((q)=) (q”)ec Cae 
_ X? + 6qX’Y + 12qg7XY* + 8q°Y° 
i 2 
(q° Jos 
X3 + q@X?Y + 2q?XY? + 3q°Y° 
Se ee 
(q° Jos 
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Comparing terms containing powers of g congruent to 4 modulo 5 on both sides, 
we see that 


>») p(5n + 4)q"** = 0 (mod 5). a 


n>0 


Notice that, at no extra cost, we obtain the congruences 


Y p(5n)q" = X3/(q°)., 
n>=0 


2 
dy, p(n +19 =GX7Y/(@ as 


n>=0 
y p(5n + 2)qg™"*? = 2q?XY?/(q°).,, and 
n>0 

2 
pone 3g" = 30° ¥°7 (Gg). 
n>=0 


It is not hard to show that each of X,Y is an infinite product. Indeed, as we 
shall see, 


X= I (1 — qg?"- PY) = ger 1 a q°?"), (2) 
Y= Il (1 _ gee ( _ qg?">)(1 _ Ge): (3) 


It follows that 


» p(sn)q" = [] 


(1 = gy) an gr *)\(1 =_ q>") 


n>=0 n>1 (1 = qo -4y( _ gin 
(1 = q°") 
5n+1)q" = Se ee 
Epon Dats TN yr a 
(eget) 
5n+2)q" =2 a ees. Td 
» pl )q I 1 — g-3)(1 — g5"-?) 


(1 s, gq’ *)\(1 = gr") = qr") 
5n+3)q" =3 a 
Z pl )q i (1 af gt) (1 _ gr) 


These remarkable results are due to Atkin and Swinnerton-Dyer {1, Theorem 1]. 


We now show that X,Y are the infinite products claimed in (2) and (3). 
We have 


= 50n?2+5n __ 50n7+45n+10 _ 50n7+55n+15 50n27+95n+45 
X= dig 9 2a + dig 


n>0 . n=O n>=0 n>=0 


In the first sum, replace n by —n, in the third replace n by —n — 1, and in the 
fourth replace n by n — 1. Then we find | 


= 50n?—-5n _ 50n7+45n+10 _ 50n*+45n+10 50n*—5n 
X= dig 4 4 + dig 


n<0 n>=0 n<-l n>1 


oO 


CO 
50n2—5n 50n2+45n+10 
49 =. eg 


n=—© n=—o 


y_ ( _ 1) gern, 


n=—o 
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The terms for n even in the final sum correspond to the first sum on the line 
above; the terms for n odd to the second sum. 
In the same way, we find 


= 50n7+15n __ 50n27+35n+5 _ 50n?+65n+20 50n*+85n+35 
Y= dq 4 4 = 2.9 


n=O n>0 n>=0 n=O 
= 50n?-15n __ 50n7+35n+5 _ 50n*+35n+5 50n?—-15n 
= q q q + q 
n<0 n=O n<-l n>1 . 
CO CO 
= > gn mien _ > gt Te 
n=—-© n=—o 
CO 
n 2 
eo ye (-1) ger —ISn)/2 
n=—o 


To complete the proof, we now invoke Jacobi’s triple product identity [3, Theorem 
352], in the form | 


y (-1)"a%g’-)7 = I] (1 = aq”""')(1 _ a~'q")\(1 = q”) 
with g replaced by g* and a replaced by q’® and q°, respectively. 
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UNSOLVED PROBLEMS 
Edited by Richard Nowakowski 


In this department the MONTHLY presents easily stated unsolved problems dealing with 
notions ordinarily encountered in undergraduate mathematics. Each problem should be 
accompanied by relevant references (if any are known to the author) and by a brief 


description of known partial or related results. Typescripts should be sent to Richard 
Nowakowski, Department of Mathematics & Statistics & Computing Science, Dalhousie 
University, Halifax NS, Canada B3H 3J5, rin@mscs.dal.ca 


The Cayley Addition Table of Z, 


Hunter S. Snevily 


Few mathematical objects could be considered more simple than the Cayley 
addition table of Z, but we show that even these simple objects have some 
interesting yet unproved properties. 

A transversal of an n X n matrix is a collection of n cells, no two of which are in 
the same row or column. A transversal of a matrix is a Jatin transversal if no two of 
its cells contain the same element. 


Conjecture 1. For any odd n and any k € {1,...,n}, any k X k submatrix of the 
Cayley addition table of Z,, contains a latin transversal. 


Conjecture 2. For any even n and any k € {1,...,n}, and any k X k submatrix of 
the Cayley addition table of Z, contains a latin transversal provided the submatrix 
is not a subgroup of even order or a translate of such a subgroup. 


Perhaps a stronger version of Conjecture 1 might be true. 


Conjecture 3. Let A be the Cayley table of any Abelian group of odd order n, and 
let k € {1,...,n}. Then any k X k submatrix of A contains a latin transversal. 


Conjecture 3 implies a fundamental property of finite fields of odd characteris- 
tic: Let GF [q] be a finite field of odd characteristic containing g elements and let 
A = {a,,a,,...,a,} and B= {b,,b,,...,b,}, k <q, be two subsets of GF[q]. 
Then there is a permutation 7 € S, such that the sums a, + b,,,, in (GF[q]), are 
pairwise distinct. | 

So far the only result that supports these conjectures (and the next) is the 
following theorem of Noga Alon. The special case k = n for Conjectures 1, 2, and 
3 was proved in [2]. 


Theorem 1 (Alon). Let p be a prime, suppose k < p, let (a,,a,,...,a,) be a 
sequence of not necessarily distinct members of the finite field Z,,, and let B be a subset 
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of cardinality k of Z,,. Then there is a numbering (b,, b,,...,5,) of the elements of B 
such that the sums a; + b; (in Z,) are pairwise distinct. 


Consider the infinitely long periodic sequence of integers modulo n. For 
example, modulo 6 the sequence is: 012345012345012345.... Now take k copies of 
this sequence (k < n) and start them at any position (i.e., truncate them anywhere). 

For example, taking the sequence modulo 6 and k = 3, 

012345012345012345...; 

012345012345012345...; 

012345012345012345... 
all start at the same position and 


012345012345012345....; 
234501234501234501...; 
501234501234501234.... 
all start at different positions. A k < k frame (matrix) consists of k consecutive 
columns taken from such an arrangement. For the second example all frames look 
like: 
abc 
cef 
gab 
This is the template of our k sequences. 
Conjecture 4. Take k copies of the infinitely long periodic sequence modulo n 


(k < n) and start them anywhere (i.e., truncate them anywhere). Then the tem- 
plate has a latin transversal. 


Note that Alon has proved this conjecture when n is a prime. The following 
examples show why &k cannot equal n: 


n=k=4 n=k=5 
0123 01234 
0123 and 01234 
0123 01234 
1230 01234 
12340 
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PROBLEMS AND SOLUTIONS 


Edited by Gerald A. Edgar, Daniel H. Ullman, and Douglas B. West 


with the collaboration of Paul T. Bateman, Mario Benedicty, Paul Bracken, Duane M. Broline, Ezra 
A. Brown, Richard T. Bumby, Glenn G. Chappell, Randall Dougherty, Roger B. Eggleton, Ira M. 
Gessel, Bart Goddard, Jerrold R. Griggs, Douglas A. Hensley, Richard Holzsager, John R. Isbell, 
Robert Israel, Kiran S. Kedlaya, Murray S. Klamkin, Fred Kochman, Frederick W. Luttmann, Vania 
Mascioni, Frank B. Miles, Richard Pfiefer, Cecil C. Rousseau, Leonard Smiley, John Henry Steelman, 
Kenneth Stolarsky, Richard Stong, Charles Vanden Eynden, and William E. Watkins. 


Proposed problems and solutions should be sent 1n duplicate to the MONTHLY 
problems address on the inside front cover. Submitted problems should include 
solutions and relevant references. Submitted solutions should arrive at that address 
betore November 30, 1999; Additional information, such as generalizations and 


references, 1s welcome. The problem number and the solver’s name and address 
should appear on each solution. An acknowledgement will be sent only if a mail- 
ing label is provided. An asterisk (*) after the number of a problem or a part of a 
problem indicates that no solution 1s currently available. 


PROBLEMS 


10739. Proposed by Oscar Ciaurri, Logrofio, Spain. Suppose that f: [0,1] — R has 
a continuous second derivative with f(x) > O on (0, 1), and suppose that f(0) = 0. 
‘Choose a € (0, 1) such that f’(a) < f (1). Show that there is a unique b € (a, 1) such that 


f'(a) = f(b)/b. 


10740. Proposed by Charles Vanden Eynden, Illinois State University, Normal, IL. A 
connected bipartite simple graph has a unique bipartition, meaning a partition of the vertices 
into two independent sets. Let G be the set of such graphs that have no isomorphism that 
interchanges the two sets of the bipartition. Is there a criterion that for each G € G selects 
a well-defined set of the bipartition? 


10741. Proposed by Tim Keller, Fair Oaks, CA. Is there an even base b for which there exist 
square integers of the form dddd,? By dddd), we mean the four-digit number in base b all 
of whose digits are d. For odd b we have the examples 11117 = 207 and 44447 = 402. 


10742. Proposed by Emre Alkan, University of Wisconsin, Madison, WI. Let us say that 
a finite group G has the maximal property if, for any prime p that divides |G|, G has a 
maximal subgroup H such that p| | divides |G|. 

(a) Show that every finite solvable group has the maximal property. 

(b) Show that there are infinitely many finite groups with the maximal property that are not 
solvable. 

(c) Show that there are infinitely many finite groups without the maximal property that are 
not solvable. 


10743. Proposed by Calin Popescu, Université Catholique de Louvain, Louvain-La-Neuve, 
Belgium. Let p > 5 be prime, and let n be an integer such that (p + 1)/2 <n < p —2. 
Let R = }-(—1)'(;), where the sum is taken over the quadratic residues i modulo p, and 
let N = >\(-1)/ (“), where the sum is taken over the quadratic nonresidues j modulo p. 
Prove that exactly one of R and N is divisible by p. 
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10744. Proposed by Peter Lindqvist, Norwegian University of Science and Technology, 
Trondheim, Norway, and Jaak Peetre, University of Lund, Lund, Sweden. Fix p > O, and 
define functions S(x), C(x), and T(x) for sufficiently small x by 


a) dt 1 dt Tix) gt 
a= A= peor 2 ———_—, x= ———____, 
f (1 — tp)P—D/P . (1 — rp)(P-D/p (1 + 17)2/P 


Show that S(x)? + C(x)? = 1 and that T(x) = S(x)/C(x). The case p = 2 yields the 
familiar trigonometric formulas. 


10745. Proposed by M. J. Pelling, London, England. For n > 1, let f(n) be the number of 
solutions (r, 5, t) in positive integers to the Diophantine equation rst = n(r +s +1). 

(a) Prove that f(n) = O(n'/2+*) for every 5 > 0. 

(b)* Prove that f(n) = O(n°) for every 5 > 0. 


SOLUTIONS 


Using the Walls to Find the Center 


10386 [1994, 474]. Proposed by Jordan Tabov, Bulgarian Academy of Sciences, Sofia, 
Bulgaria. Let a tetrahedron with vertices A;, Az, A3, Aq have altitudes that meet in a point 
H. For any point P, let P;, P2, P3, and P4 be the feet of the perpendiculars from P to 
the faces A2A3Aq4, A3Aq4A 1, AqgA,A2, and A, A?2A3, respectively. Prove that there exist 
constants a1, a2, a3, and a4 such that one has 


—>- —> ——> —> ss —> 
a;PP, +a2PP.+a3PP3+a4PPyh = PH 

for every point P. 

Solution by Murray S. Klamkin, University of Alberta, Edmonton, Alberta, Canada. More 

generally, let H and P be any two points in the space of the given tetrahedron and let P), 

P, P3, P4 be the feet of the lines through P parallel to HA,, HA2, HA3, H Ag in the faces 


of the tetrahedron opposite A;, Az, A3, A4, respectively. Then there exist constants a), a2, 
a3, a4, independent of P, such that 


—> —> —> —_—_> —s —> 
a,PP| +a2PP2+a3PP3+a4P Py = PH. 

Let V denote the vector from an origin outside the space of the given tetrahedron to any 
point V in the space of the tetrahedron. Then H and P have the representations (barycentric 
coordinates) 

H = xj A; + x2A2 + %3A3 4+ x4A4 (41 +424+23 +44 = 1), 
P = u,;A, + 42A2 + 43A3 4+ u4Aq (uy +u2 +43 + U4 = 1). 
Since P; has the representation P) = r2A2 + r3A3 + rgAq4, where r2 + 73 +17r4 = 1, we 
must have 
12A2 +73A3 + 7r4Aq4 — P = A,(H — A)). 


Since A;, Az, A3, Aq are independent vectors, we get Ay = u;/(1 — x1), so that PP, — 
(P; — P) = (H— A))u,/C0 — x1). Similarly, 


(P; —P) = (H— Ai)—— fori = 1,2,3,4. 
Choosing a; = 1 — x;, we obtain 
\\ai(P; —P) =) ui(H — Ai) =H—P = PH. 
This proof generalizes to give an analogous result for n-dimensional simplices. 


Solved also by J. Anglesio (France), R. J. Chapman (U. K.), M. Golomb, K. Hanes, N. Komanda, O. P. Lossers (The Netherlands), 
and the proposer. 
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A Reciprocal Summation Identity 
10490 [1995, 930]. Proposed by Seung-Jin Bang, Ajou University, Suwon, Korea. Show 


that ; 
k-] n 
— 1) 35 (7+; —e o+2)= Ss 


for each positive integer n. 


Solution I by Jeremy E. Dawson, Australian National University, Canberra, Australia. Since 


Pal fod. f> 1 
[<=] - | x*"! dx dydz = =, 
0 2/0 YJo k 


we have )y_, 1/k? = { z fy y fe A —x")/( — x) dx dy dz. Letting w = 1—-x, 
v = 1— yy, and u = 1 — z, wecan rewrite the latter integral as 


] 1] ] 1 i |e @ ie n 
| / Loe ee ere 
0) l—u Uu l—v v W 


Now use (1 —(1-—w)" )/w = ys 1(- 1k) w kK—1 and interchange the order of sum- 
mation and integration. For the resulting multiple integrals, we use 


1 | 1—s™ eee 
i r lat = sit! 
—| m(1 —s) ne 


l=] 


twice to obtain 


[7 Lf w*-!dwdvd Li. Sui! a Reb 
W k= — u“u= mes —, 
l—u l-—v j Ce) et 


Thus 


Solution II by A. N. ’t Woord, University of Technology, Eindhoven, The Netherlands. We 
prove first that if b, = Se 1 ak +9) forn > 1, then 


(1) Be bk on -»¢ Ok ' ) and (2) a,= dor (7) forn > 1. 


k=] 


(1) follows by induction on n: it is immediate for n = 1; and for n > 1 we have 
n n n lL n n 
AK 8 > ak (" — ') ak (; = ‘) br ak | by 
E(N=re(" er F(t )- Oo E+ S()- OF 
fa * \* re a rs ee pa yess k 


For (2), we use (7) (*) 


n— 


(7) (= id and )op—;(— = - ‘) = din to obtain 


Seore()a EF (\ lea En(l) Hoe) am 


We now begin with the identity 1 = )°7_ (-1)* se "I Applying (1) twice yields 


a 


Il 
k=!1 art) k=1 
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Applying (2) now yields 


n—| n k 
(— a ga as oR? 


Dividing by (—1)"—! and applying (1) once more yields the desired identity. 


Solution III by O. P. Lossers, University of Technology, Eindhoven, The Netherlands. Let 


fx n= a -(*)y4 yta-w) 


ee 


We prove that f(x) — f(0) = —>“7_, x*/k>. Since f(1) = 0, this yields the desired 
identity f (0) = )-f_, 1/k. 
Let A be the transformation taking p(x) to xp’(x). Direct computation shows that 


A3(— op x*/k?) = — _, x*. Since A is injective on the space of polynomials with 
constant term 0, it suffices to show that A?( f(x) -— f(0) ) i ae x* as well. 
The linearity of A yields A°( f(x) — f(0)) = A?( f(x) ). Now we compute directly 


. ‘ k ; 
A?( f(x)) - a( Giguere» Saree 


k=] j=l J 


‘ 1—(1—x) 
“(2 Go) 
k=] 
S (Nepta-oh tea a St 


k=] l—-x 


Solution IV by Victor Hernandez, Universidad Nacional de Educacion a Distancia, Madrid, 
Spain. The desired identity is the case m = 3 of the more general formula 


ems Oe > 
= Mm" 
I<i}<~<in<n [10° * tm \bm ta & 


-Consider two probabilistic processes for partitioning the unit interval [0, 1] into m + 1 
subintervals. Let Xo = 1 and Yo = O. Choose X;+); uniformly at random in [0, X;], and 
choose Y;4; uniformly at random in [Y;, 1]. By symmetry, X,, and 1 — Y,, have the same 
distribution. We compute | 


| dx, [{*! dx *m—1 1 
E(X*—!) -{ =f > ie ff x! dem = ym and 


\ dye pa 1a 1 2. 

= YI y2 k-1 
E(y* ] =| Teen Sf d — ae ee ~. 
a =) ta, mie dae OP ae, ee 


Thus 
a 4 i 1— x" 1—(1-Y,,)" 
Fane xe-l) Scene! ie ore isa Cappiina 1 


-roo(eey= yo PEt) 


L<ij<--<im<n ‘10° "tm 


where we have set i,, = k. 
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Editorial comment. Other solvers used generating functions, inductive arguments, and var- 
ious identities and transformations. Joe Howard and Heinz-Jiirgen Seiffert independently 
observed that the result follows fairly quickly from a result proved by the proposer in his 
solution to Problem 4427, School Science and Mathematics 95 (1995) 221. 


Solved also by E. S. Andersen & M. E. Larsen (Denmark), J. C. Binz (Switzerland), P. Bracken, D. Bradley (Canada), E. Braune 
(Austria), D. Callan, R. J. Chapman (U. K.), D. A. Darling, V. Dwivedi (India), E. Hertz, M. Hoffman, J. Howard, A. Kaplan 
(France), A. A. Kelzon (Russia), R. A. Kopas, O. Krafft & M. Schaefer (Germany), J. H. Lindsey II, J. Lorch, P. McCartney, D. K. 
Nester, A. Pechtl (Germany), F. Qi (China), E. Schmeichel, L. Scribani (South Africa), H.-J. Seiffert (Germany), A. Sinefakopoulos 
(Greece), I. Sofair, R. Sprugnoli (Italy), A. Stadler (Switzerland), A. Stenger, J. Van hamme (Belgium), M. Vowe (Switzerland), 
Z. Wu, GCHQ Problems Group (U. K.), WMC Problems group, and the proposer. 


A Limit of Periods 


10603 [1997, 567]. Proposed by Yury J. Ionin and Robin R. Lewis, Central Michigan 
University, Mt. Pleasant, MI. Let a, b, and k be positive integers, and let P;,(a, b) be the 
period of the sequence {a” mod as een Find limg—o0 Pe41(a, b)/ Px(a, b). 


Solution by the proposers. The limit equals the largest divisor of b that is relatively prime 
tod. 

Suppose first that a and b are relatively prime. Fixing a and D, let Py = Py(a, b). We have 
a’k = 1 mod bk, soa? = 1+ 4,b* for some integer q,. Note that P, divides Py, ,. Thus 
Pri, = ug Py, where ux is the smallest positive integer u such that a“/* = 1 mod b*t!, 
Since | 

a“ Pk — (1 + qyb*)" = (1 + ugyb*) mod b**!, 


we have uz = b/d,, where dy = gcd(qx, b). Thus Pri, = bPy/dg. 

If k > 2, then the equalities a/*+! = 14 grb! = (1+ gyb* yb /4 imply that 
Gk+1 = (qk/dx) + tgp b, where ¢, is an integer. Therefore, if p is a common prime divisor 
of b and gx, then p occurs in the prime factorization of gx41 with an exponent smaller than 
its exponent in the prime factorization of q,. If p is a prime divisor of b that does not divide 
qx, then p also does not divide qx 1. If k is sufficiently large, this implies that g, and b are 
relatively prime, so d, = 1 and Pxi4, = bP x. It follows that the desired limit is b. 

Suppose now that a and b are not relatively prime. Let 7 be the largest divisor of b 
that is relatively prime to a, and let b = rs. Now Px(a, b) is a period of the sequence 
{a” mod payee and thus Px(a, r) divides Px (a, b). 

On the other hand, consider the expression qghkar+tm _ qm This is divisible by r*, and 
for large m it is divisible by s*. Since gcd(r, s) = 1, it must therefore be divisible by b*, 
and we discover that P;,(a,r) is a period of the sequence {a” mod b* neej: Hence Px (a, b) 
divides Pi (a, r). We conclude that Py (a, b) = Py (a, r). Since a andr are relatively prime, 
the claim follows from the previous case. 


Solved also by D. A. Callan, R. J. Chapman (U. K.), J. H. Lindsey II, A. N. ’t Woord (The Netherlands), GCHQ Problems Group, 
and NCCU Problems Group. 


Avoiding the Identity 


10606 [1997, 664]. Proposed by Thomas Zaslavsky, Binghamton University, Binghamton, 
NY. Given a positive integer m, show that there is a positive integer 1 such that, for every 


group G of order at least n, it is possible to choose m elements g1, 22,..., 2m So that no 
product of the form ie vee ied with 1 < k < m and distinct subscripts i1, i2, ..., 2% in 


{1,2,...,m} equals the identity. 


Solution by Stephen M. Gagola, Jr, Kent State University, Kent, OH. Let a signed product 
be a product of distinct group elements or their inverses in a specified order. A set S is 
admissible if the identity is not expressible as a signed product of elements of S. We prove 
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that every group of order at least f(m) = | 2”—!(m — 1)!,/e | has an admissible subset of 
size Mm. 

For S C G, let S* be the set of signed products of elements of S. Let g(m) = f(m+1)—2. 
When |S| = m, we claim that |S*| < g(m). A signed product is formed by choosing an 
ordered nonempty subset of S with exponents -£1. Thus |S*| < )°7L, (7)k!2*. We can 
rewrite the bound as 2”m!T,,_;(1/2), where T,,-\(x) = Bar x//j! is the Maclaurin 
polynomial of degree m — 1 for e*. The next term in the series expansion of 2”m!e!/2 
contributes 1, while the remainder after that is at most 1. Thus |S*| < f(m+ 1) —2. 

Note that S* and G — S* are closed under taking inverses. If a signed product equals the 
identity, then each of its elements can be expressed as a signed product of the other elements 
in the product. If S is admissible and x is a nonidentity element of G — S*, it therefore 
follows that S U {x} is also admissible. Thus S can be enlarged until G — S* contains only 
the identity element. 

We now use induction on m to prove the claim that every group of order at least f(m) 
has an admissible subset of size m. When m = 1, every nontrivial group has a nonidentity 
element, and this forms an admissible set of size 1. This agrees with f(1) = 2. When 
m > 1, the monotonicity of f and the induction hypothesis imply that every group of order 
at least f(m) has an admissible subset S of size m — 1. Since g(m — 1) = f(m) — 2, there 
is a nonidentity element in G — S*, and S can be enlarged by one element. 


Solved also by P. J. Anderson (Canada), R. Barbara (France), D. Beckwith, G. L. Body (U. K.), R. J. Chapman (U. K.), J. H. 
Lindsey II, S. C. Locke, J. Merickel, K. A. Ross, M. C. Slattery, GCHQ Problems Group (U. K.), NCCU Problems Group, NSA 
Problems Group, and the proposer. 


Some Sums Require Care 


10638 [1998, 69]. Proposed by Brian Conolly, Cambridge, U. K. For 0 < X < | and 
m > 0, let Sm(A) = Dons e~"(An)"—™ /n!. Show that So(A) = 4/(1 — A), Si) = 1, 
So(A) = 1/A — 1/2, and $3(A) = 1/A* — 3/(4A) + 1/6. 


Solution I by Allen Stenger, Tustin, CA. Let 


(AeA) 2 nr—m 


n! 


Tin(A) =" Sm(A) =) > 


n>1| 


By Stirling’s formula the summand is asymptotic to (277)~!/4n-™—!/2(,e!—)". Thus, this 
sum converges absolutely for |Ae'~*| < 1. Therefore it represents a continuous function 
on [0, 1). Furthermore, if m > 1, it converges uniformly for |Ae!~*| < 1 and therefore is 
continuous on [0, 1]. For m = 0 it diverges at A = 1. 

First consider the case m = 1. We want to show that )-., (Ae~*)"n"—!/n! =A. Euler 
showed that this holds for 0 < A < 1. It can be derived by applying the Lagrange inversion 
formula to Ae~* (G. Pélya and G. Szegé, Problems and Theorems in Analysis, Volume 1, 
Springer, 1972, Part 3, Exercise 209). 

The formulas for other values of m can be derived from the case m = 1 by integration 
or differentiation. Observe that 7,,(0) = O and that by uniform convergence 


d d 22 e nA yn—m 
Ty,(A) = — 
Tas o> n! 
yn-le—nayn—(m—1) pre —naAyn—(m—}) ok 
= ee 1), 
n>1 es n> | ts 
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for A € [0, 1). Thus 


n ad XV 
To(4) = —— Ti) = —., 


— 1 
T)(A) = [ ae re ee ee eT 
0 Z 2 


x 
Ts(A) = | 
0 


for 0 < A < 1. Continuity extends the latter two formulas to A = 1, 


ls 
6 


ae 3 
To(z)dz=A- ra + 


Solution II by Thomas Hermann, SDRC, Milford, OH. Replacing e~*” by its Taylor series 
and rearranging the sum formally gives | 


S,(h) = ee Die pe Oar =>! at ome v*(" ) a i, 


n=l 


This reordering of the sum is justified as follows. By the uniform convergence proved in 
Solution I, the first sum in (+) represents an analytic function on the domain {A : |Ae!~*| < 
1}. For sufficiently small ||, this sum converges absolutely. Thus, the reordering is valid 
for |A| small. Hence the second sum in (*) is the Laurent series for the analytic function 
given by the first sum in some punctured neighborhood of 0. We show that this Laurent 
series converges for 0 < |A| < 1 if m =O and for A 4+ Oif m > 1. Therefore these two 
analytic functions agree on the connected domain {2: |Ae!~*] < 1 andO < JA| < 1} and 
on its boundary if m > 1. Thus, the two sides of (*) agree for A € [0, 1), and if m > 1 for 
dX € [0, 1]. 

The inner sum on the right side of (+) is found in I. S. Gradshteyn and I. M. Ryzhik, 
Table of Integrals, Series, and Products, Academic Press, 1994, equations 0.154 (3) and 
(4). It can also be derived by evaluating at 0 the (r — m)th derivative of (e' — 1)" in two 
different ways. One obtains 


Depr(7 arm = {e ifl<m <r; 
= n (-1)’r! ifm=0. 


A) Xr 
So(A) = = 7 (Ir! = ss 


r=l 


which holds for |A] < 1, and for m > 1, 


m r = r—m 1\™-" 
sa = 3 (Seve) CZ) 


r=!1 \n=l 


Therefore 


which holds for all A 4 0. 


Editorial comment. The proposer notes that the problem of showing S,(1) = 1 was posed 
by P. J. Cameron in connection with a result of Rényi concerning random forests of rooted 
trees. In queuing theory, S,(A) is the probability that a busy period will ever end in the 
process M/D/1 with traffic intensity 4. Since $;(A) < 1 for A > 1, a busy period for that 
process has nonzero probability of continuing forever. When 0 < A < 1, the mean number 
of customers served during a busy period is So(A)/A. 


Solved also by P. Bracken (Canada), D. Callan, R. J. Chapman (U. K.), G. L. Isaacs, J. H. Lindsey II, O. P. Lossers (The 
Netherlands), V. Lucic (Canada), R. Mortini (France), V. Schindler (Germany), H.-J. Seiffert (Germany), A. Sofo (Australia), A. 
Stadler (Switzerland), A. Tissier (France), T. V. Trif (Romania), J. Van hamme (Belgium), M. Vowe (Switzerland), WMC Problems 
Group, and the proposer. | 


592 PROBLEMS AND SOLUTIONS [June—July 


REVIEWS 


Edited by Harold P. Boas 
Mathematics Department, Texas_A & M University, College Station, TX 77843-3368 


An Imaginary Tale: The Story of ¥ —1. By Paul J. Nahin. Princeton University Press, 
1998, xvi + 257 pp., $24.95. 


Reviewed by Ricardo Diaz 


An Imaginary Tale is a leisurely, entertaining, idiosyncratic account of the develop- 
ment of the complex number system that can be enjoyed both by novices who know 
very little about complex numbers and by experts who think they know almost 
everything about them. The author, a professor of electrical engineering who also 
writes science fiction, has a special talent for expressing his infatuation with this 
fascinating topic. This accomplished raconteur has artfully avoided a direct chrono- 
logical presentation of his story. He flouts the tedious conventions of linear time to 
successfully weave a complicated tapestry that combines historical anecdotes, 
doggerel, mathematical puzzles, fascinating calculations, the occasional theorem, 
and even samples of Cauchy’s love letters. 

In the first half of the book, you will learn how close the Babylonians came to 
inventing i, discover the name of the Norwegian surveyor who gave complex 
arithmetic its first geometrical description (a decade before Argand), and marvel at 
the discomfort many distinguished mathematicians (including Leibniz, Boole, and 
Airy) suffered in the presence of the imaginary. If you proceed to the second half, 
you will be exposed to a crash course in contour integration and the Cauchy-Rie- 
mann equations, plus a sampling of applications in engineering and pure science. 
You will also see some beautiful gems from pure mathematics, such as Euler’s 
heuristic argument for expressing the sine function as an infinite product, culmi- 
nating in his derivation of the identity 77/6 = L*_,n~*. As a bonus, you will find 
in the appendices a discussion of how i' was once computed to 135 decimals in 
1921 using tables of logarithms. 

Of course this book is not a textbook, nor a scholarly treatise on the history of 
mathematics, and we must not judge it by pedantic standards. The aim of this book 
is far more ambitious than that of a textbook intended to instruct: this book was 
written to inspire and delight. Mathematics instructors at the college level will 
savor the amusing anecdotes and pass on choice items to colleagues and students. 
Eager undergraduates will enjoy learning that great mathematicians struggled over 
ideas that are now taken as obvious, and bright high school students will discover 
the adventures that await them if they persevere in their mathematical education. 

The enthusiastic author sometimes ascends to heights that most of his intended 
audience will be unable to follow, as in his discussion of the functional equation 
for the Riemann zeta function. Setting such passages aside, there is still a 
tremendous quantity of simple but elegant mathematics to enthrall, amuse, edify, 
Surprise, and enchant even the most jaded of mathematical sophisticates. This is 
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indeed a rare book: one that lives up to the self-praise that adorns its jacket. 
Professor Nahin has delivered the real goods. 

The reader who wishes to explore the history of the complex number system in 
greater depth may also wish to consult another informative, albeit less whimsical, 
book: Numbers, Heinz-Dieter Ebbinghaus et al., Springer, 1991. 


University of Northern Colorado, Greeley, CO 80639 
rdiaz@bentley.unco.edu 


Leaning Towards Infinity. By Sue Woolfe. Faber and Faber, Boston, 1997 (originally 
published by Random House Australia, 1996), xxi + 393 pp., $24.95 hardcover, $14.95 
softcover. 


Reviewed by John Beebee and Karen Willmore 


Are you curious about why men outnumber women in your advanced mathematics 
classes? Or are you interested in what a gifted writer who claims to know nothing 
of mathematics thinks about the feelings and motivations of its practitioners? If so, 
then you will find this a tempting novel, as we did. Its premise is that the search for 
truth in life, love, and mathematics is a messy and risky business. 


I think it all began because of the shape of my mother’s breasts. And it 
definitely began with something my mother wrote on the margin of a page 
stuck on the wall: Frege said that the line connecting any two points is 
already there before we draw it. (p. 3) 


What is this “it”? described by Frances Montrose, a gifted amateur mathemati- 
cian, damaged daughter of another gifted amateur, Juanita, and mother of her 
biographer, Hypatia? (And what if the Bernoullis had been mothers and daughters 
instead of fathers and sons?) Frances has just won a competition for amateur 
mathematicians and has the opportunity to present the joint ideas of herself and 
her mother at a conference in Athens, Greece, the birthplace of modern mathe- 
matics. In another possible beginning, Hypatia summarizes the plot as the story of 
“an Australian woman who’d never had any formal mathematics training but who 
in 1995 carried across the world, inside a borrowed suitcase and amongst ball 
dresses, a bulging three hundred and fifteen pages of revolutionary theorems, and 
something else no one knew about—the beginnings of the discovery of a new kind 
of number” (p. xiii). 

The emotional core of the novel is interwoven around four generations of 
unresolved mother-daughter relationships. Beautiful Juanita, abandoned by her 
self-centered mother, is a mathematical genius who in unthinking rebellion marries 
a man who can’t count past ten. Her mathematics, compulsively pursued during 
stolen time, almost reaches a breakthrough. She pins her love and her mathemati- 
cal dreams on her beautiful son, who has no use for either. Frances, her very plain 
daughter, is so consumed with trying to win her mother’s love and recognition by 
completing her mother’s mathematics that she neglects her own daughter, Hypatia 
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—who despite her name, claims that she can barely add. Hypatia tries to break this 
destructive cycle with her own daughter, Zoe, by writing down the story of her 
mother’s life. Is it possible, this novel asks, to be both a mother and a great 
person? For as Hypatia writes, motherhood is “about the allocation of the soul” 
(p. 216). 

In an interview on Wisconsin Public Radio (The Beauty of Math, 11/4/97), Sue 
Woolfe told of how she, a nonmathematician, came to use mathematics as a 
metaphor of who we are. She was living in an isolated place, nursing a baby, and 
listening to the ramblings of her neighbor, a mathematician dying of cancer. He 
talked not about his disease but about mathematics, and she came to realize that 
mathematics was as exotic and wonderful to him as her baby was to her. Woolfe 
went on to explain how she sees the connection between mathematics and love: 
both are full of assumptions that the mathematician and the lover accept on faith, 
with no guarantees of certainty. 

According to the root meaning of the word, amateurs are those who love what 
they do, and it is in this sense that Woolfe characterizes her very gifted female 
mathematicians as amateurs. In a family tree at the beginning of the novel, Juanita 
is labeled a “housewife and secret mathematician,” Frances an “English teacher 
and amateur mathematician,” and her one-time husband a “professional mathe- 
matician.” Frances’ husband would never discuss mathematics with her and left her 
as soon as their daughter was born. Woolfe portrays professional mathematicians 
mostly as conceited, ambitious, and rude (and male), more concerned with defend- 
ing the established profession against outsiders than with mathematical ideas. She 
comes down solidly on the side of “cranks,” those rare brilliant outsiders who are 
in love with mathematics. She even begins her novel (yes, another beginning) with 
the most famous crank story of all—the story of Srinivasa Ramanujan. 

Juanita once confesses tp a young Frances, “I’m in love with thinking. I don’t 
mean thinking things through or thinking things over or thinking things out.... 
Thinking itself.... It’s as abstract as death.... All the mess of life removed. And 
what’s left, it can go anywhere” (pp. 122-123). For Frances, mathematics is very 
much connected with her mother’s beauty: “She’s always been mathematics to me, 
beautiful beyond compare, unbearably elusive, unbearably unknowable” (p. 250). 
She says doing mathematics is like being “in an undiscovered country...and I’m 
like a blind person. There’s something up ahead of me, and I don’t know what it is. 
I’m feeling with my hands a bit sticking out here, and a bit going in there. Maybe 
over here is an arch. Maybe over there is a tunnel. It’s like something that might be 
a labyrinthine cathedral” (p. 46). 

Woolfe tells the story of these women’s lives and loves in anything but a 
straightforward, linear fashion. She warns us of her casual attitude toward time in 
the family tree, which ends with Juanita’s great grandson being born in 2003. 
Frances relates her experiences in Greece as occurring in the present, but then all 
through her story (which actually is being told by Hypatia as if she were her 
mother) she randomly digresses to recall her childhood, which in turn is inter- 
spersed with recalling her mother’s story. Hypatia, meanwhile, constantly inter- 
rupts all this retelling with her own asides, sometimes interpreting events, some- 
times telling her own story as Frances’ daughter, sometimes telling stories of 
famous mathematicians from the past (including the historical Hypatia). Initially, 
this is all very confusing for us as readers, and a certain amount of perseverance 
and faith is required before one begins to sort out just what is happening to whom 
and when—which may be not unlike doing mathematics. At any rate, Hypatia, in 
one of her digressions, gives this justification: “My mother’s mind was probably 
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never one of those minds that travel in straight lines. Who knows? That may have 
been the secret of her greatness....(Some might say that digression is to be 
expected of a mathematician, that mathematics is a digression)” (pp. 9-10). 

Before assigning Leaning Towards Infinity to our book club, composed of mostly 
academic types with a token physician, I (Karen) had John read it to make sure 
that the mathematics made sense. All but one book club member liked the novel 
very much; all agreed that it was confusing at first; one person disliked the 
metaphoric and image-laden language. Australians really like this book: it was the 
best-selling Australian novel in 1996. 

I John) half anticipated a mixture of feminism and superficial impressions of 
what mathematicians do, written in a painful and pretentious literary style. What I 
found was different. Although the prose was painful to me at first, it became more 
natural as I got drawn in and began to untangle the threads of the story. Woolfe 
has an unusually sensitive grasp of the process of mathematical creation, and I 
found her thinking about motherhood and mathematics to be uniquely her own. 
The mathematics conference in Athens—surreal, funny, and moving—should not 
be missed. 
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THE COMPUTER ILLITERACY PROBLEM: A PARTIAL SOLUTION | 


PETER G. LYKOS 


1. How the computer is affecting education. The computer is transform- 
ing the ways in which the problem solvers and the decision makers of our 
society go about doing their jobs. Not only are old problem-solving 
techniques being speeded up and scaled up, but new techniques are being 
invented and developed which would not have been considered seriously 
before the invention and proliferation of the computer. And the prolifera- 
tion of the computer is being accelerated as a direct consequence of two 
technological developments: oo 


1. The low cost and increasing flexibility of minicomputers, and 

2. The increasing flexibility and ease of use of. tele-communications 
systems whereby users can use a typewriter-like or keyboard plus TV-like 
terminal to access a variety of computers remotely located. 
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Precalculus, T(13: 1), C. Explorations in Col- 
lege Algebra. Linda Almgren Kime, Judy 
Clark. Wiley, 1998, xviii + 649 pp, $73.95 (P), 
with CD ROM. [ISBN 0-471-10698-4] De- 
signed “to shift the focus from learning a set 
of discrete mathematical rules to exploring how 
algebra is used to answer questions about the 
physical and social world around us.” Similar 
in spirit to other reform texts; reflects NCTM 
and AMATYC standards. 


Precalculus, S(13). Maths: A Student’s Sur- 
vival Guide. Jenny Olive. Cambridge Univ 
Pr, 1998, xiii + 564 pp, $29.95 (P); $74.95. 
[ISBN 0-521-57586-9; 0-521-57306-8] Self- 
help workbook format; over 800 questions with 
detailed solutions. Topics range from basic al- 
gebra through single-variable calculus. Aimed 
at science and engineering students. 


Foundations, S(14-16: 1), S, P, L. Explain- 
ing Chaos. Peter Smith. Cambridge Univ 
Pr, 1998, viii + 193 pp, $59.95; $19.95 (P). 
[ISBN 0-521-471710; 0-521-477476] Philo- 
sophical treatment of chaos theory—its foun- 
dations, history, and modern role in science. 
Good, accessible explanations, nice examples, 
some rigorous mathematical descriptions. RM 


Combinatorics, P*, L. Handbook of Com- 
binatorial Optimization, Volumes 1-3. Eds: 
Ding-Zhu Du, Panos M. Pardalos. Kluwer Aca- 
demic, 1998, $1325 set, [ISBN 0-7923-5019-7] 
set. Volume 1, viii + 785 pp; Volume 2, viii + 
753 pp; Volume 3, viii + 865 pp. 33 expository 
articles survey algorithmic approaches to many 
discrete and combinatorial problems. Most ar- 
ticles are accessible to nonspecialists. 
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Discrete Mathematics, T(13—14: 1), S, L*. 
Mathematical Problems and Proofs: Com- 
binatorics, Number Theory, and Geometry. 
Branislav Kisa¢anin. Plenum Pr, 1998, xiv + 
220 pp, $55. [ISBN 0-306-45967-1] Intro- 
duction to basic results and techniques. Empha- 
sizes interesting examples and problems. DB 


Number Theory, P. Random Matrices, Frobe- 
nius Eigenvalues, and Monodromy. Nicholas 
M. Katz, Peter Sarnak. Colloquium Public., 
V. 45. AMS, 1999, xi + 419 pp, $69. [ISBN 
0-8218-1017-0] There is empirical evidence 
that the distribution of the spacings between ze- 
roes of the zeta-function is the same as a certain 
probability measure from random matrix the- 
ory. This book establishes this relationship for 
several classes of zeta- and L-functions over 
finite fields. DB 


Algebra, P. Elimination Methods in Polyno- 
mial Computer Algebra. Valery Bykov, et al. 
Math. & Its Applic., V. 448. Kluwer Aca- 
demic, 1998, xi + 237 pp, $106. [ISBN 0-7923- 
5240-8] Methods based on multidimensional 
residue theory. Illustrates theory with applica- 
tions to mathematical kinetics. 


Algebra, P. Higher Category Theory. Eds: 
Ezra Getzler, Mikhail Kapranov. Contemp. 
Math., V. 230. AMS, 1998, x + 134 pp, $34 (P). 
[ISBN 0-8218-1056-1] Proceedings of a 1997 
workshop at Northwestern University. 


Algebra, P. Trends in the Representation The- 
ory of Finite Dimensional Algebras. Eds: Ed- 
ward L. Green, Birge Huisgen-Zimmermann. 
Contemp. Math., V. 229. AMS, 1998, xiii + 
356 pp, $75 (P). [ISBN 0-8218-0928-8] Pro- 
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ceedings of the 1997 Joint Summer Research 
Conference at the University of Washington. 


Calculus, S**(13), L. Calculus Mysteries and 
Thrillers. R. Grant Woods. MAA, 1998, xix 
+ 131 pp, $24.95 (P). [ISBN 0-88385-711-1] 
11 projects, each in the form of an amusing short 
story, designed to develop students’ modeling 
and technical writing skills. Appropriate for use 
in single-variable calculus courses. Includes a 
sample solution for each project. AO 


Calculus, T(13: 1). Workshop Calculus: 
Guided Exploration with Review, Volume 2. 
Nancy Baxter Hastings, et al. Springer-Verlag, 
1998, xxiv + 397 pp, $39.95 (P). [ISBN 0- 
387-98349-X] Second of a two-volume se- 
ries that integrates review of precalculus top- 
ics with standard first-semester calculus top- 
ics. Designed for use in a “workshop” class- 
room environment, each volume is a collection 
of guided exploration activities and homework 
exercises. (Volume J, TR, October 1997.) 


Calculus, T(14: 1). Vector Calculus. P.C. 
Matthews. Springer-Verlag, 1998, ix + 182 pp, 
$29.95 (P). [ISBN 3-540-76180-2] Concise 
introduction. Many references to topics in the 
physical sciences. Uses suffix notation; in- 
cludes a chapter on Cartesian tensors. Assumes 
some knowledge of linear algebra (matrices and 
determinants), and basic calculus (differenti- 
ation, integration, and partial differentiation). 
Includes solutions to all of the exercises. AO 


Calculus, T(13: 3). Calculus: A New Horizon, 
Sixth Edition. Howard Anton. Wiley, 1999, 
Xxxii + 1314 pp, $94.95. [ISBN 0-471-15306- 
0] Major revision. More emphasis on con- 
ceptual understanding and applicability; some 
reorganization of topics. (Fifth Edition, TR, 
October 1995.) 


Complex Analysis, T(18), S, P. The Loga- 
rithmic Integral I. Paul Koosis. Stud. in Adv. 
Math., V. 12. Cambridge Univ Pr, 1998, xviii + 
606 pp, $47.95 (P); $130. [ISBN 0-52 1-59672- 
6; 0-521-30906-9] Corrected paperback edi- 
tion. (First Edition, TR, May 1989.) 


Differential Equations, P. Differential Equa- 
tions: La Pietra 1996. Eds: M. Giaquinta, J. 
Shatah, S.R.S. Varadhan. Proc. of Symp. in 
Pure Math., V. 65. AMS, 1999, xi + 219 pp, 
$39. [ISBN 0-8218-0610-6] Proceedings of a 
conference held to honor the 70th birthdays of 
Peter Lax and Louis Nirenberg. 


Differential Equations, T(14: 1). Differen- 
tial Equations: Graphics, Models, Data. David 
Lomen, David Lovelock. Wiley, 1999, xiv 
+ 682 pp, $96.95. [ISBN 0-471-07648-1] 
Covers standard set of topics using numeri- 
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cal, graphical, analytical, and descriptive view- 
points. Encourages an exploratory approach. 
Motivates many topics using real-world prob- 
lems or data. AO 


Partial Differential Equations, P. Parametric 
Lie Group Actions on Global Generalised Solu- 
tions of Nonlinear PDEs. Elemér E. Rosinger. 
Math. & Its Applic., V. 452. Kluwer Academic, 
1998, xvii + 234 pp, $106. [ISBN 0-7923-5232- 
7] 


Partial Differential Equations, T(15: 1), L. 
Beginning Partial Differential Equations. Pe- 
ter V. O'Neil. Wiley, 1999, x + 500 pp, 
$79.95. [ISBN 0-471-23887-2] For a first 
course in PDEs. Topics: method of character- 
istics; classification of second-order equations; 
well-posedness; Fourier series; wave and heat 
equations; Dirichlet and Neumann problems. 
Some exercises require use of a CAS. PG 


Numerical Analysis, P. Parameter Estima- 
tion in Nonlinear Dynamical Systems. W.J.H. 
Stortelder. CWI Tract, V. 124. Centrum voor 
Wiskunde en Informatica, 1998, vi + 176 pp, 
Dfl. 40 (P). [ISBN 90-6196-482-2] Numeri- 
cal and statistical aspects of the parameter es- 
timation problem for dynamical systems de- 
scribed by differential algebraic equations. In- 
cludes several applied case studies. 


Numerical Analysis, S(16-17), P, L*. Ma- 
trix Algorithms, Volume I; Basic Decomposi- 
tions. G.W. Stewart. SIAM, 1998, xix + 
458 pp, $32 (P). [ISBN 0-89871-414-1] First 
of a planned 5-volume series for non-specialists. 
Emphasizes algorithms, their derivation, and 
their analysis. Initial chapters provide basic 
background on matrices, linear algebra, and the 
realities of matrix computations on computers. 
Subsequent chapters discuss LU, QR, and rank- 
reducing decompositions. AO 


Operator Theory, P. Operator Algebras and 
Operator Theory. Eds: Liming Ge, et al. Con- 
temp. Math., V. 228. AMS, 1998, xx + 389 pp, 
$85 (P). [ISBN 0-8218-1093-6] Proceedings 
of a 1997 conference in Shanghai, China. 


Operator Theory, P. /ntroduction to Vertex Op- 
erator Superalgebras and Their Modules. Xi- 
aoping Xu. Math. & Its Applic., V.456. Kluwer 
Academic, 1998, xvi + 356 pp, $159. - [ISBN 
0-7923-5242-4] 

Functional Analysis, P. Geometria de Es- 
pacios de Banach. José A. Facenda Aguirre. 
Universidad de Sevilla (Porvenir, 27—41013 
Sevilla, SPAIN), 1998, vi + 139 pp, (P). [ISBN 
84-472-0473-1] 

Analysis, S(18), P. Advanced Integration The- 
ory. Corneliu Constantinescu, et al. Math. & 
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Its Applic., V. 454. Kluwer Academic, 1998, x 
+ 861 pp, $375. [ISBN 0-7923-5234-3] This 
huge work aims to tie together theories used by 
probabilists and analysts. Vector lattice theory 
provides the framework for their study, includ- 
ing L?-spaces, real measures, absolute conti- 
nuity. Each section contains many exercises. 
Good for self-study. KS 


Algebraic Geometry, P. Algebra and Geome- 
try. Ed: Ming-chang Kang, et al. International 
Pr, 1998, ix + 227 pp. [ISBN 1-57146-058-6] 
Proceedings of a 1995 conference at National 
Taiwan University. 


Differential Geometry, P. The Theory of Fins- 
lerian Laplacians and Applications. Eds: Peter 
L. Antonelli, Bradley C. Lackey. Math. & Its 
Applic., V. 459. Kluwer Academic, 1998, xxx 
+ 282 pp, $146. [ISBN 0-7923-5313-7] 


Differential Geometry, P. Harmonic Maps, 
Loop Groups, and Integrable Systems. Mar- 
tin A. Guest. London Math. Soc. Stud. Texts, 
V. 38. Cambridge Univ Pr, 1997, xiv + 194 pp, 
$21.95 (P); $59.95. [ISBN 0-521-58932-0; 0- 
521-58085-4] “The specific goal of this book 
is to show how the theory of loop groups can be 
used to study harmonic maps.” 


Differential Geometry, P. New Developments 
in Differential Geometry, Budapest 1996. Ed: 
J. Szenthe. Kluwer Academic, 1999, xii + 
519 pp, $227. [ISBN 0-7923-5307-2] 36 pa- 
pers based on presentations at the conference. 


Differential Geometry, P. Topics in Symplec- 
tic 4-Manifolds. Ed: Ronald J. Stern. Lect. 
Ser., V. 1. International Pr, 1998, 111 + 124 pp. 
[ISBN 1-57146-019-5] 6 papers based on in- 
vited lectures presented at a 1996 conference at 
the University of California at Irvine. 


Geometry, P. Monge Ampére Equation: Appli- 
cations to Geometry and Optimization. Eds: 
Luis A. Caffarelli, Mario Milman. Contemp. 
Math., V. 226. AMS, 1999, ix + 172 pp, 
$39 (P). [ISBN 0-8218-0917-2] 10 invited 
lectures and contributed papers from a 1997 
NSF-—CBMS conference held at Florida Atlantic 
University. 


Topology, P. Continuous Selections of Mul- 
tivalued Mappings. DuSan Repovs, Pavel 
Vladimirovié Semenov. Math. & Its Applic., 
V. 455. Kluwer Academic, 1998, viii + 356 pp, 
$159. [ISBN 0-7923-5277-7] 


Operations Research, P. Fuzzy Sets in Deci- 
sion Analysis, Operations Research and Statis- 
tics. Ed: Roman Stowiski. Handbooks of 
Fuzzy Sets Ser. Kluwer Academic, 1998, xxiv 
+ 453 pp, $169.95. [ISBN 0-7923-8112-2] 
13 chapters in four sections: Decision Mak- 
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ing; Mathematical Programming; Statistics and 
Data Analysis; Reliability, Maintenance, and 
Replacement. 


Game Theory, T(18), S, P. Dynamic Non- 
cooperative Game Theory, Second Edition. 
Tamer Basar, Geert Jan Olsder. Classics in 
Appl. Math., V. 23. SIAM, 1999, xv + 
519 pp, $49.50 (P). [ISBN 0-89871-429-x] 
Unabridged, revised republication of the 1995 
Second Edition originally published by Aca- 
demic Press. This edition includes new material 
on the Braess Paradox and on the relationship 
between the existence of solutions of Ricatti 
equations and the existence of Nash equilibrium 
solutions. (First Edition, TR, April 1984.) 


Mathematical Modeling, T(15), L. Mathe- 
matical Analysis for Modeling. Judah Rosen- 
blatt, Stoughton Bell. Math. Modeling Ser. 
CRC Pr, 1999, 860 pp, $69.95. [ISBN 0- 
8493-8337-4] Presents mathematical frame- 
work for modeling problems in science and 
technology. Topics covered include Riemann 
and Lebesgue integration, Taylor’s theorem, in- 
finite series, multivariable calculus, coordinate 
systems, matrices, Fourier transforms, and gen- 
eralized functions. PG 


Optimal Control, P. Differential Geometry and 
Control. Eds: G. Ferreyra, et al. Proc. of Symp. 
in Pure Math., V.64. AMS, 1999, viii +341 pp, 
$79. [ISBN 0-8218-0887-7] 20 papers from 
the 1997 AMS Summer Research Institute held 
at the University of Colorado, Boulder. 


Stochastic Processes, P.  Martingales et 
chaines de Markov. Laurent Mazliak, Pierre 
Priouret, Paolo Baldi. Hermann, 1998, viii + 
215 pp, 180 F(P). [ISBN 2-7056-6382-7] 


Stochastic Processes, P. Optimal Filtering, 
Volume I: Filtering of Stochastic Processes. 
Vladimir Fomin. Math. & Its Applic., V. 457. 
Kluwer Academic, 1999, xiii + 375 pp, $173. 
[ISBN 0-7923-5286-6] 


Stochastic Processes, P. Stochastic Partial Dif- 
ferential Equations: Six Perspectives. Eds: 
Rene A. Carmona, Boris Rozovskii. Math. 
Surv. & Mono., V. 64. AMS, 1999, xi+ 334 pp, 
$49. [ISBN 0-8218-0806-0] 6survey articles; 
each presents a different perspective on the pri- 
mary topic. 

Stochastic Processes, P. One-Dimensional 
Random Polymers. R.W. van der Hofstad. CWI 
Tract, V. 123. Centrum voor Wiskunde en In- 
formatica, 1998, 165 pp, Dfl. 40 (P). [ISBN 
90-6196-48 1-4] 

Statistical Methods, P. Maximum Entropy 
and Bayesian Methods. Eds: Gary J. Erick- 
son, Joshua T. Rychert, C. Ray Smith. Fund. 
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Theories of Physics, V. 98. Kluwer Academic, 
1998, 1x +297 pp, $133. [ISBN 0-7923-5047-2] 
Proceedings of the 17th International Workshop 
held at Boise State University in 1997. 
Algorithms, T(15—17: 1), S, P, L. Combi- 
natorial Algorithms: Generation, Enumeration, 
and Search. Donald L. Kreher, Douglas R. 
Stinson. Disc. Math. & Its Applic. CRC Pr, 
1999, 329 pp, $74.95. [ISBN 0-8493-3988-X] 
Crisp modern treatment. Standard topics plus 
generation of combinatorial objects, groups and 
symmetry, computing isomorphisms, basis re- 
duction. Nice concrete examples. RM 
Computer Science, T(14—15: 1). Computer 
Systems. J. Stanley Warford. Jones & Bartlett, 
1999, xv + 506 pp. [ISBN 0-7637-0794-5] 


Computer Science, P. Reliable Computer Sys- 
tems: Design and Evaluation, Third Edition. 
Daniel P. Siewiorek, Robert S. Swarz. AK Pe- 
ters, 1998, xix + 908 pp, $65. [ISBN 1-56881- 
092-X] 

Applications (Biological Science), T(16-17: 
2), P, L*. Mathematical Physiology. James 
Keener, James Sneyd. Interdisc. Appl. Math., 
V. 8. Springer-Verlag, 1998, xix + 766 pp, 
$69.95. [ISBN 0-387-98381-3] An introduc- 
tory survey emphasizing continuous, determin- 
istic approaches. Illustrates how mathematics 
provides insight into physiological questions as 
well as how physiological questions can lead to 
new mathematical problems. First part presents 
the fundamentals of cellular physiology; sec- 
ond part discusses the physiology of systems. 
Accessible to mathematicians with little knowl- 
edge of physiology. AO 

Applications (Engineering), T(14—15: 4). Ad- 
vanced Engineering Mathematics, Eighth Edi- 
tion. Erwin Kreyszig. Wiley, 1999, xvi 
+ 1273 pp, $109.95. [ISBN 0-471-15496- 
2] New edition of this classic text/reference 
(Seventh Edition, TR, January 1995). Major 
changes: more emphasis on qualitative meth- 
ods and applications in problem sets; projects 
of various types (e.g., team, writing, CAS); up- 
dated chapters on numerical methods. AO 
Applications (Fluid Mechanics), P. Applica- 
tions of Group-Theoretical Methods in Hydro- 
dynamics. V.K. Andreev, et al. Math. & Its 
Applic., V. 450. Kluwer Academic, 1998, xii + 
396 pp, $205. [ISBN 0-7923-5215-7] 


Applications (Fluid Mechanics), P. Theory 
of Multicomponent Fluids. Donald A. Drew, 
Stephen L. Passman. Appl. Math. Sci., V. 135. 
Springer-Verlag, 1999, x + 308 pp, $59.95. 
[ISBN 0-387-98380-5] 
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Applications (Physics), T(16-17).  Classi- 
cal Electrodynamics. Walter Greiner. Theor. 
Physics. Springer-Verlag, 1998, x + 555 pp, 
$49.95 (P). [ISBN 0-387-94799-X] Careful 
presentation of classical electrostatics, magne- 
tostatics, and electrodynamics. No exercises, 
but many worked examples. AO | 


Applications (Quantum Theory), P. Quan- 
tum Measures and Spaces. G. Kalmbach. 
Math. & Its Applic., V. 453. Kluwer Academic, 
1998, xi + 343 pp, $159. [ISBN 0-7923-5288- 
2] | | 


Applications (Quantum Theory), P. Dirac 


Operators and Spectral Geometry. Giampiero 
Esposito. Lect. Notes in Phys., V. 12. Cam- 
bridge Univ Pr, 1998, xiii + 209 pp, $34.95 (P). 
[ISBN 0-52 1-64862-9] 


Applications (Systems Theory), P. Stabil- 
ity of Finite and Infinite Dimensional Systems. 
Michael I. Gil’. Intern. Ser. in Engin. & Comp. 
Sci. Kluwer Academic, 1998, xviii + 358 pp, 
$145. [ISBN 0-7923-822 1-8] 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
241: Learning, Control and Hybrid Systems. 
Eds: Yutaka Yamamoto, Shinji Hara. Springer- 
Verlag, 1999, xv + 451 pp, $92 (P). [ISBN 
1-85233-076-7] Proceedings of a 1998 work- 
shop in Bangalore, India, honoring B.A. Francis 
and M. Vidyasagar. 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
240: Low Gain Feedback. Zongli Lin. 
Springer-Verlag, 1999, xvii + 354 pp, $105 (P). 
[ISBN 1-85233-081-3] 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
238: Computational Methods for Controller 
Design. Nicola Elia, Munther A. Dahleh. 
Springer-Verlag, 1998, xi + 153 pp, $59 (P). 
[ISBN 1-85233-075-9] 


Applications (Systems Theory), P. Lecture 
Notes in Control and Information Sciences— 
239; Finite Spectrum Assignment for Time- 
Delay Systems. Qing-Gou Wang, Tong Heng 
Lee, Kok Kiong Tan. Springer-Verlag, 1999, 


viii + 117 pp, $45 (P). [ISBN 1-85233-065-1] 


Reviewers 


DB: David Bressoud, Macalester, PG: Philip Gloor, 
St. Olaf; RM: Richard Molnar, Macalester; AO: Amold 
Ostebee, St. Olaf; KS: Karen Saxe, Macalester; PZ: Paul 
Zom, St. Olaf. 
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upon by students, educators, 
engineers, and scientists around 
the world. It does for algebra, 
equations, trigonometry, vectors, 
matrices, and calculus what the 
scientific calculator does for 
numbers — it eliminates the 
drudgery of performing long and 
tedious mathematical | 
calculations. You can easily 
solve both symbolic and numeric 
problems and see the results plotted as 
2D or 3D graphs. 


For everyday mathematical work 
DERIVE is a tireless, powerful, and 
knowledgeable assistant. For teaching 
or learning mathematics, DERIVE gives 


Soft Warchouse: 
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Inc. Other trademarks are the property of their respective owners. 


for Windows 


Site Licenses and Student Pricing. 
See www.derive.com 
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"you the freedom to explore 
different mathematical approaches 
better and more quickly than by 
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System Requirements: 
Windows 95, 3.1x or NT running 
on a computer with 8 megabytes 
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Suggested Retail Price: $250. 
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For product information and list of 
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Packaged with a PC compatible disk that enhances the 
material in the text. 


Suitable for classroom adoption in an innovative course for 
e a general education mathematics elective 
e a mathematics or science major advanced elective 
¢ an interdisciplinary course, even at a relatively 
elementary level 
e a mathematical modeling course in a 
civil/environmental engineering program 


This book has a dual objective: first, to introduce the reader 
to some of the most important and widespread environmen- 
tal issues of the day; and second, to illustrate the vital role 

_ played by mathematical models in investigating these issues. 
The environmental issues addressed include: ground-water 
contamination, air pollution, and hazardous material emer- 
gencies. These issues are presented in their full real-world 
context, not as scientific or mathematical abstractions; and 
for background, readers are invited to investigate their status 
in their own communities. 

The first part of the book leads the reader through rela- 
tively elementary modeling of these phenomena, including 
simple algebraic equations for ground water, slightly more 
complex algebraic equations (preferably implemented on a 
spreadsheet or other computerized framework) for air pollu- 
tion, and a fully computerized modeling package for haz- 
ardous materials incident analysis. The interplay between 
physical intuition and mathematical analysis is emphasized. 

For more advanced readers, the second part of the book 
returns to the same three subjects but with a higher level of 
mathematical sophistication (adjustable to the preparation of 
the reader by selection of subsections.) Many important clas- 
sical mathematical themes are developed through this con- 
text, examples coming from single and multivariable calcu- 
lus, differential equations, numerical analysis, linear algebra 
and probability. The material is presented in such a way as to 
minimize the required background and to encourage the 
subsequent study of some of these fields. 

An elementary course for a general audience could be 
based entirely on Part I, and a higher level mathematics, sci- 


THE MATHEMATICAL ASSOCIATION OF AMERICA MF ] 


Mathematical 
Modeling 


in the Environment 


Charles Hadlock 


Series: Classroom Resource Materials 


ence, or engineering course could move quickly to Part 2. 

A PC compatible diskette packaged with the text con- 
tains a spreadsheet program that facilitates the numerical 
experimentation with the Gaussian plume equation intro- 
duced in Chapter 3, as well as public domain DOS program 
(ARCHIE) for evaluating the consequences from various haz- 
ardous materials scenarios (e.g., the physical extent of flam- 
mable and toxic vapor clouds). Text is not tied to the use of 
this software, but it is included as an aid to meet the peda- 
gogical objectives of the text. 
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Contains the complete solutions and further discussion of 
nearly every exercise presented in the textbook. This includes 
both the mathematical/computational exercises as well as the 
research questions and investigations. Readers will benefit 
greatly from perusing solutions to the problems whether they 
have worked them out themselves or not. Students using this 


volume will still need to work out solutions of research ques- 
tions using their own sources and adapting them to their own 
geographic locations, or using their own computational 
schemes, so this volume could well be useful for students in’ 
many course contexts. Enrichment material is included on 
the topics of some of the exercises. Advice for teachers who 
lack previous environmental experience, but who want to 
teach this material is also provided and makes it practical for 
such persons to offer a course based on these volumes. 
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A Course in Mathematical Modeling 


This book is intended as a text for a modeling 
course accessible to students who have mastered a 
one year course in calculus. It balances a variety of 
opposing modeling methodologies including theo- 
retical models versus empirical models, analytical 
models versus simulation, deterministic models 
versus stochastic models, and discrete models 
versus continuous models. Most of the examples 
are drawn from real-world data or from models that 
have been used in various applied fields. The use of 
computers in both simulation and in mathematical 
analysis is an integral part of the presentation. 


The authors emphasize the teaching of the model- 
ing process as opposed to merely presenting 
models. They begin their book with the simple dis- 
crete exponential growth model, and successively 
refine it to include variable growth rates, multiple 
variables, growth rates fitted to data, and the effects 
of random factors. The last part of the book moves 
into continuous-time models. Issues of model 
validity and purpose are emphasized throughout. 


Students taking a course based on this book should 
have some mathematical maturity, but will need 
little advanced knowledge. The book presents more 
advanced topics on an as-needed basis and serves 
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to show how the different topics of undergraduate 
mathematics can be used together to solve prob- 
lems. This perspective is valuable as either a road 
map for beginning students or as a capstone for 
more advanced students. The course presents 
elements of discrete dynamical systems, basic 
probability theory, differential equations, matrix 
algebra, stochastic processes, curve fitting, statistical 
testing, and regression analysis. Computer analysis 
is extensively used in conjunction with these topics. 


You can also use this book if you are seeking appli- 
cations to supplement a course in linear algebra, 
differential equations, difference equations, proba- 
bility theory, or statistics. 


Catalog Code: MMLJR 
400 pp., Paperbound, 1999 
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Visit <www.wku.edu/~swiftrj/Modeling/ model- 
ing.html> where you can visit the authors’ web- 
site and download data sets, Mathematica files, 
and other modeling resources that execute the 
models described in the text. 
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This fascinating little book is eminently readable, and it 
is a great deal of fun to peruse.., the book is a real treat. 
We need more books like this, crafted by expert hands yet 
crafted so that the general reader can enjoy them. 
—Bulletin of The Institute of Combinatorics and 
Its Applications 


This excellent and entertaining book is suitable for a first 
course in cryptology for mathematical enthusiasts. An 
abundance of exercises and an excellent list of related ref- 
erences are included. 

—The Mathematics Teacher 


In spite of the light-hearted style in which the book is 
written throughout, it is a serious—-and successful—- 
attempt to explain the basis of coding and decoding mes- 
sages...I can strongly recommend this book to anyone who 
wants a brief but comprehensive, eminently readable, and 
up-to-date introduction to this increasingly popular topic. 
— The Mathematical Gazette 


All of cryptology is covered in this work...Occupying a 
niche in the halls of the ivory tower of pure mathematics 
for nearly two millennia, number theory now forms a pil- 
lar of modern society. This book is the best explanation 
available today of how that pillar was constructed. 

| — Charles Aschbacher 


A model to follow in order to make mathematics better 
known and understood. Accessible to a broad audience. 
Have fun reading this book, while you are getting a better 
understanding of cryptology. 

— Bulletin of the Belgian Mathematics Society 


How can messages be transmitted secretly? How | 


can one guarantee that the message arrives safely 


in the right hands exactly as it was transmitted? 
Cryptology—the art and science of “secret writ- 
ing”—provides ideal methods to solve these prob- 
lems of data security. 


The book is fun to read, and the author presents 
the material clearly and simply. Many exercises 
and references accompany each chapter. 


176 pp., Paperbound, 1994 
ISBN 0-88385-504-6 

List: $34.00 

MAA Member: $26.95 
Catalog Code: CRYPT/JR 
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If you teach calculus, you should read this book. If you 
want to know what mathematics your students understand, 
or if you want to know how to find out what they under- 
stand, this book contains essential information for you. 


It doesn’t matter whether you teach a reform or traditional 
course, whether you have large or small sections, or 

- whether you use lectures or laboratories. The bottom line 
is the same: When all is said and done, what counts is 
what our students understand. And that’s what Student 
Assessment in Calculus is about. 


Over the last ten years calculus instruction has changed 
in numerous ways. Whether they were trying on new | 
ideas or following the more traditional routes towards 
conceptual understanding, both individual faculty and 
departments needed to know if their instruction was 
effective. To help deal with that issue, the National Science 
Foundation brought together a Working Group of experts 
in students’ mathematical thinking, in assessment, and in 
calculus reform. The goals of their work were to: 
e develop a framework to tailor calculus instruction to the 
students’ needs; 
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e establish an agenda for further research on student 
understanding; 

e describe how to make use of a range of techniques to 
test what students know, such as multiple-choice tests or 
short essay questions, student portfolios and “clinical” 
interviews; 

e summarize major goals of the reform movement and 
describe the challenges faced by those who are taking a 
closer look at how students learn; 

e illustrate the ways in which calculus projects attempt 
(via exams, papers, projects, etc.) to find out what their 
students have learned. 


This book is the result of those efforts. If you teach calcu- 
lus, if you want to see examples of useful assessment tech- 
niques, or if you are interested in issues of how to mea- 
sure student learning in mathematics, then there is a lot for 
you here. 
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Robert Ash 


A Primer of Abstract Mathematics prepares the reader 

to cope with abstract mathematics, specifically abstract 
algebra. It can serve as a text for prospective mathe- 
matics majors, as well as for those students taking or 
preparing to take a first course in abstract algebra, or 
those in applied fields who need experience in dealing 
with abstract mathematical ideas. 


Learning any area of abstract mathematics involves writ- 
ing formal proofs, but it is equally important to think 
intuitively about the subject and to express ideas clearly 
and cogently. The author aids intuition by keeping 
proofs short and as informal as possible, using concrete 
examples which illustrate all the features of the general 
case, and by giving heuristic arguments when a formal 
development would take too long. The text can serve as 
a model on how to write mathematics for an audience 
with limited experience in formalism and abstraction. 


Ash introduces several expository innovations in A 
Primer of Abstract Mathematics. He presents an entirely 
informal development of set theory that gives students 
the basic results that they will need in algebra. The 
chapter which presents the theory of linear operators 
introduces the Jordan Canonical Form right at the begin- 
ning, with a proof of existence at the end of the chapter. 
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Contents: 
Logic and Foundations: Truth Tables, Quantifiers, Proofs, 
Sets, Functions, Relations. 


Counting: Fundamentals, The Binomial and Multinomial 
Theorems, The Principle of Inclusion and Exclusion, 
Counting Infinite Sets. 


Elementary Number Theory: The Euclidean Algorithm, 
Unique Factorization, Algebraic Structures, Further 
Properties of Congruence Modulo m, Linear Diophantine 
Equations and Simultaneous Congruences, Theorems of 
Euler and Fermat, The Mobius Inversion Formula. 


Some Highly Informal Set Theory: Well-Orderings, Zorn’s 
Lemma and the Axiom of Choice, Cardinal Numbers, 
Addition and Multiplication of Cardinals. 


Linear Algebra: Matrices, Determinants and Inverses, The 
Vector Space F”, Linear Independence and Bases, Subspaces, 
Linear Transformations, Inner Product Spaces, Eigenvalues 
and Figenvectors. , 


Theory of Linear Operators: Jordan Canonical Form, Th 
Minimal and Characteristic Polynomials, The Adjoint of a 
Linear Operator, Normal Operators, The Existence of the 
Jordan Canonical Form. 


Appendix: An Application of Linear Algebra. 
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Sixth Edition 


Series: Spectrum 


H. S. M. Coxeter'’s classic book on non-Euclidean 
geometry was first published in 1942, and enjoyed 
eight reprintings before it went out of print in 1968. 
The MAA is delighted to be the publisher of the - 
sixth edition of this wonderful book, updated with 
a new section 15.9 on the author's useful concept of 
inversive distance. 


Throughout most of this book, non-Euclidean 
geometries in spaces of two or three dimensions are 
treated as specializations of real projective geometry 
in terms of a simple set of axioms concerning points, 
lines, planes, incidence, order and continuity, with 
no mention of the measurement of distances or angles. 
This synthetic development is followed by the intro- 
duction of homogeneous coordinates, beginning 
with Von Staudt’ idea of regarding points as entities 
that can be added or multiplied. Transformations 
that preserve incidence are called collineations. 
They lead in a natural way to elliptic isometries or 
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“congruent transformations”. Following a recom- 
mendation by Bertrand Russell, continuity is 
described in terms of order. Elliptic and hyperbolic 
geometries are derived from real projective geome- 
try by specializing an elliptic or hyperbolic polarity 
which transforms points into lines (in two dimen- 
sions) or planes (in three dimensions) and vice versa. 


An unusual feature of the book is its use of the 
general linear transformation of coordinates to derive 
the formulas of elliptic and hyperbolic trigonometry. 
The area of a triangle is related to the sum of its 
angles by means of an ingenious idea of Gauss. This 
treatment can be enjoyed by anyone who is familiar 
with algebra up to the elements of group theory. 
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A perfect gift for the new teacher. . . or for anyone inter- 
ested in the teaching of mathematics. 


This book is the legacy of twenty years of mathematics 
teaching. During this time, the author searched for moti- 
vation techniques, mnemonics, insightful proofs, and 
serious applications of humor to aid his teaching. The 
result is this book: part philosophy, part humor, and part 
_ biography. Readers will be amused and enlightened on 


every page. 


Mr. Stueben shows how he has used humor and word- 
play to motivate his students. The book is filled with 
wonderful problems and proofs, as well as the author's 
insights about how to approach teaching problem solving 
to high school students. Sections of the book also treat 
the use of calculators and computers in the classroom. A 
section on mnemonics shows how teachers can use mem- 
ory aids to help their students learn and retain material. 
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The Lessons and Humor of a Mathematics Teacher 


Michael Stueben with Diane Sandford 


All in all, Twenty Years Before the Blackboard provides a 
goldmine of ideas for the classroom teacher. Although Mr. 
Stueben taught at the high school level, his book is an excel- 
lent “methods” book for mathematics teachers at all levels. 


Read what Martin Gardner has to say about this fascinat- 
ing book: 


It’s been decades since I read so entertaining a book about 
mathematics. The book is a treasure-trove of mathematical 
jokes, rhymes, anecdotes, word play, mnemonics, and beautiful 
proofs. For teachers there is an abundance of wise advice 
based on the author’s twenty years in high school teaching. 
Mathematicians at all levels, from amateurs to college profes- 
sors will not only chuckle over its gems, but learn much they 
did not know before. —Martin Gardner 
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This book is based on the notes of a course in logic given by 
Paul Halmos. This book retains the spirit and purpose of 
those notes, which was to show that logic can (and perhaps 
should) be viewed from an algebraic perspective. When so 
viewed, many of its principal notions are seen to be old 
friends, familiar algebraic notions that were “disguised” in logi- 
cal clothing. Moreover, the connection between the principal 
theorems of the subject and well-known theorems in algebra 
becomes clearer. Even the proofs often gain in simplicity. 


Propositional logic and monadic predicate calculus—predicate 
logic with a single quantifier— are the principal topics treated. 
The connections between logic and algebra are carefully 
explained. The key notions and the fundamental theorems are 
elucidated from both a logical and algebraic perspective. The 
final section gives a unique and illuminating algebraic treatment 
of the theory of syllogisms—perhaps the oldest branch of logic, 
and a subject that is neglected in most modern logic texts. 


The presentation is aimed at a broad audience—mathematics 
amateurs, students, teachers, philosophers, linguists, computer 
scientists, engineers, and professional mathematicians. 
Whether the reader’ goal is a quick glimpse of modern logic or 
a more serious study of the subject, the book’s fresh approach 
will bring novel and illuminating insights to beginners and pro- 
fessionals alike. All that is required of the reader is an acquain- 
tance with some of the basic notions encountered in a first 
course in modem algebra. In particular, no prior knowledge of 
logic is assumed. The book could serve equally well as a fire- 
side companion and as a course text. 
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Contents: What is Logic?: To count or to think; A small 
alphabet; A small grammar; A small logic; What is truth?; 
Motivation of the small language; All mathematics. 
Propositional Calculus: Propositional symbols; 
Propositional abbreviations; Polish notation; Language as 
an algebra; Concatenation; Theorem schemata; Formal 
proofs; Entailment; Logical equivalence; Conjunction; 
Algebraic identities. Boolean Algebra: Equivalence class- 
es; Interpretations, Consistency and Boolean algebra; 
Duality and commutativity; Properties of Boolean alge- 
bras; Subtraction; Examples of Boolean algebras. 
Boolean Universal Algebra: Subalgebras; : 
Homomorphisms; Examples of homomorphisms; Free 
algebras; Kernels and ideals; Maximal ideals; 
Homomorphism theorem; Consequences; The represen- 
tation theorem. Logic via Algebra: Pre-Boolean algebras; 
Substitution rule; Boolean logics; Algebra of the proposi- 
tional calculus; Algebra of proof and consequence. 
Lattices and Infinite Operations: Lattices; Non-distribu- 
tive lattices; Infinite operations. Monadic Predicate 
Calculus: Propositional functions; Finite functions; 
Functional monadic algebras; Functional quantifiers; 
Properties of quantifiers; Monadic algebras; Free monadic 
algebras; Modal logics; Monadic logics; Syllogisms. 
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Elementary Mathematical Models (EMM) claims a middle ground 
between, college algebra and liberal arts mathematics. Like the col 
lege algebra course, FMM emphasizes the elementary functions of 
analysis: linear, quadratic, polynomial, and rational functions; 
square roots; exponentials and logarithms. These functions are the 
building blocks for the simple models that appear in first courses 
in the physical, life, and social sciences. And while EMM does not 
stress algebraic manipulation as an end in itself, it does recognize 
how important algebra is. Moreover, it provides students with the 
opportunity to see for themselves why algebra is needed, and 
what it contributes to formulating and analyzing models. 


Like the liberal arts mathematics course, EMM makes a con- 
certed effort to convey something of the scope, power, and 
fascination of mathematics, to students who may never study 
mathematics again. Each mathematical topic evolves natu- 
rally in formulating simple discrete models for inherently 
interesting contexts. For example, exponential functions 
emerge from the study of models exhibiting geometric 
growth—defined as growth that increases by equal propor- 
tions in equal periods of time. Throughout the course, a. 
recurring theme is the evolution from simple recursive 
hypotheses ( e.g., the population next year will be 10% 
greater than this year), to difference equations (p, 4 1 = 

1.1 p,,), to solutions (p,, = Pp (1.1), to qualitative behavior 
of models, This theme appears repeatedly as the students 
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New in the Classroom Resource Materials Series...suitable 
as a text in college algebra, finite mathematics, precaiculus, 
and liberal arts mathematics courses 


encounter a series of increasingly sophisticated growth mod- 
els, starting with arithmetic growth and ending with logistic 
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